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PLANT SUGAR SENSORS AN D USES THEREOF 

5 Background of the Invention 

This application relates to plant carbohydrate metabolism; in particular, to 
enzymes which transduce sugar-sensing signals, their encoding genes, and uses 
thereof. 

Sugars are said to be regulatory molecules that are capable of controlling 
10 physiology, metabolism, cell cycle, development, and gene expression. Throughout 
the higher plant life cycle, from germination to flowering to senescence, sugars affect 
growth and development. Recently, it has become apparent that sugars are 
physiological signals capable of repressing or activating plant genes that are involved 
in many essential processes, including photosynthesis, the glyoxylate cycle, 
15 respiration, starch and sucrose synthesis and degradation, nitrogen metabolism and 
storage, pathogen defense, the wounding response, cell cycle progression, 
pigmentation, and senescence (Sheen, Photosynthesis Res. 39, 427 (1994); Thomas 
and Rodriguez, Plant Physiol 106, 1235 (1994); Knight and Gray, Mol. Gen. Genet. 
242, 586 (1994); Lam et al, Plant Physiol. 106, 1347 (1994); Chen et al., Plant J. 6, 
20 625 (1994); Reynolds and Smith, Plant Mol. Biol. 29, 885 (1995); Herbers et al., 
Plant Mol. Biol. 29, 1027 (1995); Mita et al., Plant Physiol. 107, 895 (1995)). 
Studies in a variety of plant species have also shown that sugar homeostasis appears 
to be tightly regulated. Elevated sugar concentration leads to stunted growth, reduced 
photosynthesis, leaf curling, chlorosis, necrotic leaves, and anthocyanin accumulation 

2 5 (Casper et al., Plant Physiol. 79, 1 1 (1985); von Schaewen et al., EMBO J. 9, 3033 

(1990); Dickinson et al., Plant Physiol. 95, 420 (1991); Tsukaya et al., Plant Physiol. 
97, 1414 (1991); Sonnewald et al., Plant! 1, 95 (1991); Huberet and Hanson, Plant 
Physiol 99, 1449 (1992); Sonnewald et al., Plant Responses to Sugar Accumulation 
in Transgenic Tobacco Plants, pp. 246-257, In: M. A. Madore, W. J. Lucas (eds.), 

3 0 Carbon Partitioning and Source-Sink Interactions in Plants, American Society of 
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Plant Physiologists, Rockville, MD, (1995)). In addition, environmental factors such 
as elevated C0 2 and intrinsic genetic variations such as different invertase levels have 
been proposed to affect photosynthetic capacity through sugar regulation (Stitt, Plant 
Cell Environ. 14, 741 (1991); Stitt et al., Planta 183, 40 (1991); VanOosten et al. t 
5 Plant Cell Environ. 17, 913 (1994); Nie et al, Plant PhysioL 108, 975 (1995); 
Goldschmidt and Huber, Plant Physiol 99, 1443 (1992)). 

Summary of the Invention 

By manipulating the expression of a plant hexokinase protein (HXK), we have 
discovered that this protein is a sensor that mediates diverse sugar responses in plants. 

10 In particular, we have engineered transgenic plants that either: (a) express a decreased 
level of hexokinase protein due to expression of an antisense hexokinase gene and 
therefore exhibit a decreased sensitivity to sugar; or (b) express an increased level of 
hexokinase protein and therefore exhibit an increased sensitivity to sugar. Our 
discovery has broad implications for the manipulation of agricultural crops, for 

1 5 increasing crop yield and quality, and for reducing production costs. 

In general, the invention features a method for reducing the level of a plant 
hexokinase protein in a transgenic plant cell, the method involving expressing in the 
transgenic plant cell (for example, a cell from a monocot, a dicot, or a gymnosperm) 
an antisense hexokinase nucleic acid sequence. This produces transgenic plants that 

20 are less sensitive to sugar (for example, glucose, sucrose, fructose, or mannosc). 

In preferred embodiments, the antisense hexokinase nucleic acid sequence is 
encoded by a transgene integrated into the genome of the transgenic plant cell; the 
antisense hexokinase nucleic acid sequence includes a plant antisense hexokinase 
DNA sequence (for example, a sequence that is based on the AtHXKl nucleotide 

2 5 sequence of Fig. 1 F (SEQ ID NO: 3) or the AtHXK2 nucleotide sequence of Fig. 1 G 
(SEQ ID NO: 4)); and the method further includes growing a transgenic plant from 
the transgenic plant cell, whereby the level of the hexokinase protein is reduced in the 
transgenic plant. 



-2- 



WO 97/35965 



PCI7US97/04712 



In related aspects, the invention features a plant cell (for example, a plant cell 
from a monocot, dicot, or gymnosperm) expressing an antisense hexokinase nucleic 
acid sequence; and a plant expression vector including an antisense hexokinase 
nucleic acid sequence, wherein the sequence is operably linked to an expression 
5 control region. 

In yet another aspect, the invention features a method for increasing the level 
of a hexokinase protein in a transgenic plant cell, involving expressing in the 
transgenic plant cell a hexokinase nucleic acid sequence. In preferred embodiments, 
the hexokinase nucleic acid sequence is from a plant (for example, a DNA sequence 
10 that is identical to the AtHXKl nucleotide sequence of Fig. IF (SEQ ID NO: 3) or 
that is substantially identical to the AtHXK2 nucleic acid sequence of Fig. 1G (SEQ 
ID NO: 4)). This method produces transgenic plants having an increased sensitivity 
to sugar. 

In related aspects, the invention features a substantially pure plant HXK 
15 polypeptide including an amino acid sequence substantially identical to the amino 
acid sequence of AtHXKl (SEQ ID NO: 1) or AtHXK2 (SEQ ID NO: 2). In 
preferred embodiments of both of these aspects, the HXK polypeptide is obtained 
from a plant including, but not limited to, a monocot (for example, rice, corn, wheat, 
or barley), a dicot (for example, a member of the Solanaceae (for example, potatoes) 
2 0 or a member of the Cruciferae (for example, Arabidopsis, broccoli, cabbage, brussel 
sprouts, rapeseed, kale, Chinese kale, cauliflower, or horseradish)), and a 
gymnosperm. 

In yet other related aspects, the invention features a substantially pure DNA 
encoding a plant HXK polypeptide that includes an amino acid sequence substantially 
2 5 identical to the amino acid sequence of AtHXKl (SEQ ID NO: 1) or AtHXK2 (SEQ 
ID NO: 2). In preferred embodiments, the DNA includes the nucleotide sequence 
shown in Fig. IF (SEQ ID NO: 3) or includes a nucleotide sequence that is 
substantially identical to the sequence that is shown in Fig. 1G (SEQ ID NO: 4). 
Such DNAs are obtained from any plant including, but not limited to, a monocot (for 
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example, rice, corn, wheat, and barley), a dicot (for example, a member of the 
Solanaceae (for example, potatoes) or a member of the Cruciferae (for example, 
Arabidopsis, broccoli, cabbage, brussel sprouts, rapeseed, kale, Chinese kale, 
cauliflower, or horseradish)), and a gymnosperm. In other preferred embodiments, 

5 the DNAs of the invention are operably linked to a constitutive or regulated promoter. 

In yet other related aspects, the invention features a vector including any of the 
substantially pure DNAs of the invention, the vector being capable of directing 
expression of the protein encoded by the DNA in a vector-containing cell; a cell, for 
example, a prokaryotic cell (for example, an £. coli cell) or a eukaryotic cell (for 

10 example, a plant cell) which includes any of the DNAs of the invention; and a 

transgenic plant (or a cell or a seed derived from such a transgenic plant) including 
any of the DNAs of the invention integrated into the genome of the plant, wherein the 
DNA is expressed in the transgenic plant. 

In various preferred embodiments, the plant cell contains the DNA in the sense 

15 orientation and has an increased sensitivity to sugar; the plant cell contains the DNA 
in the antisense orientation and is less sensitive to sugar; and the DNA is expressed 
under the control of a constitutive promoter or regulated promoters. 

In two other aspects, the invention features a method of producing a plant 
HXK polypeptide involving: (a) providing a cell transformed with a gene encoding a 

20 polypeptide including either an amino acid sequence substantially identical to the 
amino acid sequence of AtHXKl (SEQ ID NO: 1) or an amino acid sequence 
substantially identical to the amino acid sequence of AtHXK2 (SEQ ID NO: 2) 
positioned for expression in the cell; (b) expressing the plant HXK polypeptide; and 
(c) recovering the plant HXK polypeptide. 

25 By "hexokinase" or "HXK" is meant a polypeptide that is capable of 

catalyzing the ATP-dependent conversion of hexoses to hexose«6-phosphates. 
Methods for assaying such enzymatic activities are known in the art, e.g., those 
described herein by Renz and Stitt (Planta 190, 166 (1993)). 
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By "reducing the level of a plant hexokinase protein" is meant a decrease in 
the level of that plant hexokinase protein by at least 30-50% preferably by 50-80%, 
and more preferably by 80-95% relative to the level in a control plant (for example, a 
wild-type plant). Reduction of hexokinase protein levels may be accomplished 
5 through the expression of an antisense plant hexokinase nucleotide sequence in a 
transgenic plant. Levels of plant hexokinase protein are monitored according to any 
standard technique including, but not limited to, immunoblotting (for example, as 
described herein). Alternatively, the level of a plant hexokinase protein may be 
quantified according to standard hexose phosphorylation assays (for example, those 
1 0 described herein). 

By "increasing the level of a plant hexokinase protein" is meant increasing the 
level of that plant hexokinase protein by at least 50%, preferably 100%, and more 
preferably greater than 200% relative to the level in a control plant (for example, a 
wild-type plant). Levels of plant hexokinase protein are monitored according to any 
1 5 standard technique including, but not limited to, immunoblotting (for example, as 
described herein). Alternatively, the level of a plant hexokinase protein may be 
quantified according to standard hexose phosphorylation assays (for example, those 
described herein). 

By "an antisense hexokinase sequence" is meant a nucleotide sequence that is 
2 0 complementary to a plant hexokinase messenger RNA. In general, such an antisense 
sequence will usually be at least 15 nucleotides, preferably about 15-200 nucleotides, 
and more preferably 200-2,000 nucleotides in length. The antisense sequence may be 
complementary to all or a portion of the plant hexokinase mRNA nucleotide sequence 
(for example, the AtHXKl and AtHXK2 antisense constructs described herein), and, 
25 as appreciated by those skilled in the art, the particular site or sites to which the 
antisense sequence binds as well as the length of the antisense sequence will vary, 
depending upon the degree of inhibition desired and the uniqueness of the antisense 
sequence. A transcriptional construct expressing a plant hexokinase antisense 
nucleotide sequence includes, in the direction of transcription, a promoter, the 
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sequence coding for the antisense RNA on the sense strand, and a transcriptional 
termination region. Antisense HXK sequences may be constructed and expressed as 
described herein or as described, for example, in van der Krol et al., Gene 72, 45 
(1988); Rodermel et al.. Cell 55, 673 (1988); Mol et al., FEBSLett, 268, 427 (1990); 
5 Weigel and Nilsson, Nature 377, 495 (1995); Cheung et al., Cell 82, 383 (1995); and 
U.S. Pat. No. 5,107,065. 

By "less sensitive to sugar" is meant that the developmental, physiological, or 
molecular processes that are typically regulated or controlled by internal or external 
sugar concentrations exhibit reduced responses to the presence of a sugar (for 

10 example, glucose, fructose, mannose, or sucrose). For example, a plant having 
reduced sensitivity to sugar is capable of activating an assortment of genes (for 
example, photosynthetic genes) that are normally repressed by the presence of sugar, 
or such a plant is capable of proceeding through its normal developmental pathway 
even in the presence of sugar concentrations that would otherwise thwart or prevent 

15 such development. Analysis of a plant's sensitivity to sugar is accomplished using a 
wide variety of bioassays (for example, those described herein). These assays 
include, but are not limited to, evaluating and monitoring gene expression, seed 
germination, cotyledon development (for example, cotyledon extension), cotyledon 
greening, leaf development, embryonic root development, hypocotyl elongation, 

20 anthocyanin accumulation, starch accumulation, and time needed for flowering. By 
comparing phenotypes of wild-type plants and candidate plants (for example, a plant 
expressing an antisense hexokinase gene), one is readily able to determine whether 
such a candidate transgenic plant has a reduced sensitivity to a sugar. For example, 
sugars have been found to repress the expression of both photosynthetic (for example, 

2 5 ribulose bisphosphate carboxylase small subunit and light-harvesting chlorophyll a/b 
binding protein) and non-photosynthetic (for example, cc-amylase, sucrose synthase, 
malate synthase, and asparagine synthase) genes. Thus, in plants that are less 
sensitive to sugar, the aforementioned sugar-repressible genes have a decreased, 
reduced, or attenuated level of sugar-mediated repression. 
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By "increased sensitivity" is meant that the developmental, physiological, or 
molecular processes that are typically regulated or controlled by internal or external 
sugar concentrations exhibit increased or elevated responses to the presence of a sugar 
(for example, glucose, fructose, mannose, or sucrose). For example, a plant having 
5 increased sensitivity to sugar is capable of elevating, raising, or promoting the 

activation of an assortment of genes (for example, vegetative storage proteins) that are 
normally activated by the presence of sugar. Analysis of a plant's sensitivity to sugar 
is accomplished using a wide variety of bioassays. These assays include, but are not 
limited to, evaluating and monitoring gene expression, seed germination, cotyledon 
1 0 development (for example, cotyledon extension), cotyledon greening, leaf 

development, embryonic root development, hypocotyl elongation, anthocyanin 
accumulation, starch accumulation, and time needed for flowering. By comparing 
phenotypes of wild-type plants and candidate plants (for example, a plant expressing 
at least one additional copy of hexokinase gene), one is readily able to determine 
1 5 whether such a candidate transgenic plant has an increased sensitivity to a sugar. For 
example, sugars have been found to activate the expression of genes such as nitrate 
reductase, P-amylase, sucrose synthase, and potato storage protein. Thus, in plants 
exhibiting an increased sensitivity to sugar, the aforementioned sugar-inducible genes 
have an increased, elevated, or heightened level of sugar-mediated expression. 
20 By "polypeptide" or "protein" is meant any chain of amino acids, regardless of 

length or post-translational modification (for example, glycosylation or 
phosphorylation). 

By "substantially identical to AtHXKl" is meant a plant hexokinase 
polypeptide that includes an N-terminus which is at least 50%, preferably 75%, more 
2 5 preferably 85-90%, and most preferably 95% identical to the N-terminus of AtHXKl 
(amino acids 1-61 of Fig. IB; SEQ ID NO:l). The length of comparison will 
generally be at least 15 amino acids, preferably at least 30 amino acids, more 
preferably at least 40 amino acids, and most preferably 60 amino acids. 
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By "substantially identical to AtHXK2" is meant a plant hexokinase 
polypeptide or nucleic acid sequence that exhibits at least 86%, preferably 90%, more 
preferably 95%, and most preferably 99% identity to the amino acid or nucleic acid 
sequences of AtHXK2 (Figs. IB and 1G; SEQ ID NOS: 2 and 4). 
5 Sequence identity is typically measured using sequence analysis software (for 

example, Sequence Analysis Software Package of the Genetics Computer Group 
(University of Wisconsin Biotechnology Center, 1710 University Avenue, Madison, 
WI 53705), BLAST, or PILEUP/PRETTYBOX programs). Such software matches 
similar sequences by assigning degrees of homology to various substitutions, 
1 0 deletions, substitutions, and other modifications. Conservative substitutions typically 
include substitutions within the following groups: glycine alanine; valine, isoleucine, 
leucine; aspartic acid, glutamic acid, asparagine, glutamine; serine, threonine; lysine, 
arginine; and phenylalanine, tyrosine. 

By a "substantially pure polypeptide" is meant a plant hexokinase polypeptide 
1 5 (for example, AtHXKl or AtHXK2) which has been separated from components 

which naturally accompany it. Typically, the polypeptide is substantially pure when it 
is at least 60%, by weight, free from the proteins and naturally-occurring organic 
molecules with which it is naturally associated. Preferably, the preparation is at least 
75%, more preferably at least 90%, and most preferably at least 99%, by weight, plant 
2 0 hexokinase polypeptide. A substantially pure plant hexokinase polypeptide may be 
obtained, for example, by extraction from a natural source (for example, a plant cell); 
by expression of a recombinant nucleic acid encoding a plant hexokinase polypeptide; 
or by chemically synthesizing the protein. Purity can be measured by any appropriate 
method, for example, column chromatography, polyacrylamide gel electrophoresis, or 
25 HPLC analysis. 

By "substantially pure DNA" is meant DNA that is 
free of the genes which, in the naturally-occurring genome of the organism from 
which the DNA of the invention is derived, flank the gene. The term therefore 
includes, for example, a recombinant DNA which is incorporated into a vector, into 
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an autonomously replicating plasmid or virus; or into the genomic DNA of a 
prokaryote or eukaryote; or which exists as a separate molecule (for example, a cDNA 
or a genomic or cDNA fragment produced by PCR or restriction endonuclease 
digestion) independent of other sequences. It also includes a recombinant DNA 
5 which is part of a hybrid gene encoding additional polypeptide sequence. 

By "transformed cell" is meant a cell into which (or into an ancestor of which) 
has been introduced, by means of recombinant DNA techniques, a DNA molecule 
encoding (as used herein) an HXK polypeptide (for example, AtHXKl or AtHXK2). 

By "positioned for expression" is meant that the DNA molecule is positioned 
1 0 adjacent to a DNA sequence which directs transcription and translation of the 
sequence (i.e., facilitates the production of, for example, a plant hexokinase 
polypeptide such as AtHXKl or AtHXK2, a recombinant protein, or a RNA 
molecule). 

By "promoter" is meant a minimal sequence sufficient to direct transcription. 

1 5 Included in the invention are promoter elements that are sufficient to render promoter- 
dependent gene expression controllable for cell-, tissue-, or organ-specific gene 
expression, or elements that are inducible by external signals or agents (for example, 
light-, pathogen-, wound-, or hoimone-inducible elements); such elements may be 
located in the 5' or 3' regions of the native gene. 

2 0 By "operably linked" is meant that a gene and a regulatory sequence(s) are 

connected in such a way as to permit gene expression when the appropriate molecules 
(for example, transcriptional activator proteins) are bound to the regulatory 
sequence(s). 

By "plant cell" is meant any self-propagating cell bounded by a semi- 
2 5 permeable membrane and containing a plastid. Such a cell also requires a cell wall if 
further propagation is desired. Plant cell, as used herein includes, without limitation, 
algae, cyanobacteria, seeds, suspension cultures, embryos, meristematic regions, 
callus tissue, leaves, roots, shoots, gametophytes, sporophytes, pollen, and 
microspores. 
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By "transgene" is meant any piece of DNA which is inserted by artifice into a 
cell, and becomes part of the genome of the organism which develops from that cell. 
Such a transgene may include a gene which is partly or entirely heterologous (i.e., 
foreign) to the transgenic organism, or may represent a gene homologous to an 
endogenous gene of the organism. 

By "transgenic" is meant any cell which includes a DNA sequence which is 
inserted by artifice into a cell and becomes part of the genome of the organism which 
develops from that cell. As used herein, the transgenic organisms are generally 
transgenic plants and the DNA (transgene) is inserted by artifice into the nuclear or 
plastidic genomes. 

Other features and advantages of the invention will be apparent from the 
following description of the preferred embodiments thereof, and from the claims. 

Peteilrt Description 
The drawings will first be described. 

15 Drawings 

Figs. 1 A-E are illustrations showing various aspects of the molecular 
characterization of the Arabidopsis thaliana HXK genes. Fig. 1A shows the 
functional complementation HXK catalytic activity using a yeast hxkl/hxk2 double 
mutant (designated DBY2219) with the A. thaliana HXK homologues, AtHXKl and 

20 AtHXKl, pFL61 is shown as the control vector used in these complementation 
studies. Fig. IB is a schematic illustration showing the amino acid sequence 
comparison of A. thaliana hexokinase AtHXKl (SEQ ID NO: 1), A. thaliana 
hexokinase AtHXK2 (SEQ ID NO: 2), human GLK (SEQ ID NO: 5), rat GLK (SEQ 
ID NO: 6), Saccharomyces cerevisiae HXK1 (Yeast 1) (SEQ ID NO: 7), 5. cerevisiae 

2 5 HXK2 (Yeast2) (SEQ ID NO: 8), and Kluveromyces lactis RAG5 (Yeast3) (SEQ ID 
NO: 9). The underlined regions 1 and 2 refer to the conserved phosphate 1 and 2 
regions; the underlined dashed region refers to the adenosine interaction region. 
Amino acids which are underlined with asterisks refer to the conserved sugar binding 
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domain. Sequence analysis was performed using the PILEUP/PRETTYBOX program 
set to standard parameters. Identical and similar residues are referred to as boxed and 
highlighted regions, respectively. Fig. 1C is a schematic illustration showing the map 
positions o( AtHXKI and AtHXK2 on Arabidopsis chromosomes IV and II, 
respectively. Fig. ID is a photograph of a DNA blot analysis showing that AtHXK is 
encoded by a multigene family. The blots were hybridized with a full-length cDNA 
probe of AtHXKI (designated AtHXKL shown on the left) or MHXK2 (designated 
AtftXK2 , shown on the right). Numbers indicated on the left of the blots refer to the 
molecular size markers in kilobases. Fig. IE is a photograph of a DNA blot analysis 
of A. thaliana genomic DNA which was digested with HindUl, fractionated by gel 
electrophoresis, transferred to a nylon membrane, and hybridized with the AtHXKI 
full-length cDNA probe under low stringency conditions. Fig. IF is an illustration 
showing the nucleotide sequence of AtHXKI (SEQ ID NO: 3). Fig. 1G is an 
illustration showing the nucleotide sequence of AtHXK2 (SEQ ID NO: 4). 

Figs. 2A-B are a series of illustrations showing HXK expression in 
Arabidopsis. Fig. 2A is a photograph of an RNA blot showing the expression of 
AtHXKI and AtHXK2 in leaf (rosette and cauline), stem, flower, silique, and root 
tissues. Fig. 2B is a photograph of an RNA blot showing that the expression of 
AtHXK] and AtHXK2 is induced by light and sugar. 

Figs. 3A-F are color photographs of Arabidopsis seedlings illustrating HXK 
as a sugar sensor in plants. Fig. 3A is a photograph of transgenic Arabidopsis 
seedlings which were germinated on 1/2 MS plates containing 6% glucose and which 
have increased expression of either sense- AtHXKI (left) or anti-AtHXKl (middle) 
constructs. A wild-type (control) plant is shown on the right. Fig. 3B is a photograph 
of transgenic Arabidopsis seedlings which were germinated on 1/2 MS plates 
containing 0.8mM 2-dGlc and which have increased expression of cither sense- 
AtHXKl (left) or anti-AtHXKl (middle) constructs. A wild-type (control) plant is 
shown on the right. Fig. 3C is a photograph showing a T3 homozygous population of 
sense-AtHXKJ (left) and anti-AtHXKl (right) plants which were germinated on 1/2 
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MS plates containing 6% glucose. Fig. 3D is a photograph showing a T3 
homozygous population of sense- AtHXKl (left) and anti-AtHXKl (right) plants which 
were germinated on 1/2 MS plates containing 0.8mM 2-dGlc. Fig. 3E is a 
photograph showing sense-AtHXKl (left), anti-AtHXKl (middle), and control (right) 
5 plants which were germinated on 1/2 MS plates containing 6% mannitol. Fig. 3F is a 
photograph showing sense-AtHXKl (left), anti-AtHXKl (middle), and control (right) 
plants which were germinated on 1/2 MS plates containing 6% 3-MeGlc. 

Figs. 4A-C are a series of color photographs showing that AtHXK mediates 
sugar effects on seedling growth and development. Seedlings were grown in the dark 
10 for 6 days followed by illumination for 12 hours, on media containing various glucose 
concentrations (shown as increasing concentrations of glucose (Glc%) 2, 3, 4, 5, or 
6%). Fig. 4A is a photograph showing that higher concentrations of glucose inhibit 
hypocotyl elongation and expansion as well as greening of cotyledons in wild-type 
(control) plants. Fig. 4B is a photograph of sense-AtHXKl plants that are 
1 5 hypersensitive to glucose as indicated by the strong inhibitory effects on seedling 

development. Fig. 4C is a photograph of anti-AtHXKl plants that are less sensitive to 
glucose as shown by decreased inhibitory effects on seedling development when 
compared to wild-type plants. 

Figs. 5A-F are a series of illustrations showing the expression of various 
2 0 genes in control, sense-, and anti-sense plants. Fig. 5A is a photograph of an RNA 
blot analysis using illuminated etiolated seedlings which were germinated on 1/2 MS 
plates containing 6% glucose. UBQ expression was monitored as a control. Fig. 5B 
is a photograph of an RNA blot analysis of RNA prepared from light-grown green 
plants (Light) which were propagated without exogenous sugars, and from light- 
25 grown green plants (Dark) which were dark adapted for 3 days and then illuminated 
for 4 hours. Fig. 5C is a photograph of several RNA blots showing the expression of 
sense and antisense constructs in the transgenic plants. RNAs were extracted from 
etiolated seedlings grown on 1/2 MS plates containing 6% glucose. Gene- and strand- 
specific probes were used to reveal sense-AtHXKl (sense-1), sense-AtHXK2 (sense-2), 
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antisense-AtHXKl (anti-1), and antisense-AtHXK2 (anti-2) transcripts. The blot that 
shows sensc-AtHXKl (sense-1) expression was exposed for a longer period of time 
than the other blots. Wild-type plants were used as controls. Fig. 5D is a photograph 
of a protein blot analysis showing the expression of AtHXKl . Sense-AtHXKl plants 
5 showed elevated expression of AtHXKl, and anti-AtHXKJ plants showed reduced 
expression. Wild-type plants were used as controls. 

Fig, 5E is a bar graph showing the total hexose phosphorylation activities in etiolated 
seedlings. Fig. 5F is a bar graph showing the total hexose phosphorylation activities 
in light grown plants. Error bars show standard deviations. 

1 0 Figs. 6A-C are a series of illustrations showing that sugar signaling is 

uncoupled from sugar metabolism. Shown in Fig. 6A are photographs demonstrating 
that the growth of the wild-type strain, but not the double mutant hxkl/hxk2, is 
inhibited on a 2-dGlc/raffinose plate (left). Increased expression of either AtHXKl or 
AtHXK2 in the hxkl/hxk2 strain did not restore glucose repression, as shown by a 

1 5 level of growth for this strain on a 2-dGlc/raffinose plate which was similar to the 
double mutant transformed with vector (pFL61 ) alone (right). Fig. 6B is a color 
photograph showing the dominant interfering effect of increasing the expression yeast 
HXK2 in transgenic Arabidopsis seedlings (i.e., YHXK2 plants) which were grown for 
7 days on 1/2 MS plates containing 6% glucose. Fig. 6C is a bar graph showing total 

2 0 hexose phosphorylation activities in etiolated or green YHXK2, sense-AtHXKU and 
control plants. Error bars represent standard deviations. 

There now follows a description of the cloning and characterization of two 
Arabidopsis HXXT-encoding cDNAs which are useful in the instant invention, and a 
characterization of their ability to regulate sugar metabolism. These examples are 

2 5 provided for the purpose of illustrating the invention, and should not be construed as 
limiting. 
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Molecular Characteriza tion of Arabidopsis HXK Genes 

To elucidate the role of HXK as a sugar sensor, the Arabidopsis HXK genes 
were cloned by functional complementation using a yeast Saccharomyces cerevisiae 
hxkl/hxk2 double mutant (designated DBY2219), a strain which lacked HXK activity 
(Ma and Botstein, Mol Cell Biol 6, 4046 (1986)). Using this approach, we 
identified two cDNAs, designated AtHXKl (Fig. IF; SEQ ID NO: 3, GenBank 
accession no. U28214) and AtHXKl (Fig. 1G; SEQ ID NO: 4, GenBank accession no. 
U28215). These cDNAs, which were 2.0 and 1.9 kb in length respectively, were both 
found reproducibly to complement the yeast double mutant and to allow its growth on 
a selection plate containing fructose as the sole carbon source. Exemplary results are 
shown in Fig. 1 A; mutant yeast cells transformed with either the AtHXKl or AtHXKl 
cDNAs were capable of growth on the selection media, indicating that these genes 
complemented the double mutant. In contrast, mutants transformed with the plasmid 
vector pFL61 alone were incapable of growth on the same selection media (Fig. 1 A) 
(Minet et ah, Plant 1 2, 417 (1992)). 

DNA sequence analyses of AtHXKl (Fig. IF, SEQ ID NO: 3) and AtHXKl 
(Fig. IF, SEQ ID NO: 4) predicted open reading frames of 496 and 502 amino acids, 
respectively (Fig. IB). These genes were found to share 82% nucleotide identity and 
85% amino acid identity. In addition, database searches and sequence comparisons 
revealed that these AtHXKs shared between 34-35% sequence identity with the human 
and rat GLKs (Nishi et al., Diabetologia 35, 743 (1992); Magnuson et al., Proc. Natl 
Acad. Sci. U.S.A. 86, 4838 (1989)), and between 36-38% sequence identity with 
several yeast HXKs (Stachelek et al., Nucl Acids Res. 14, 945 (1986); Prior et al., 
Mol Cell Biol 13, 3882 (1993)). Conserved ATP- and sugar-binding domains were 
also identified in the predicted amino acid sequences of both AtHXK genes. As 
shown in Fig. IB, three domains were identified which are involved in ATP binding 
(Bork et al., Protein Sci. 2, 31 (1993)). Also shown in Fig. IB is a sugar binding 
domain which is similar to the glucose binding site found in mammalian GLK (Bork 
et al., Protein Sci. 2, 31 (1993)). In general, our sequence comparison revealed that 
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the overall sequence and structure of the Arabidopsis HXKs were similar to those of 
the mammalian GLK and yeast HXKs, but distinct from that of plant fructokinase 
(Smith et al., Plant Physiol. 102, 1043 (1 993)). 

We next determined the chromosomal positions of AtHXKl and AtHXK2 
5 genes by standard segregation analysis of restriction fragment length polymorphisms 
(RFLPs) in recombinant inbred lines (Nam et al., Plant Cell 1, 699 ( 1 989); Lister and 
Dean, Plant J. 4, 745 (1993); Hauge et al., Plant J. 3, 745 (1993); Schmidt et al., 
Science 270, 480 (1 995); Zachgo et al.. Genomic Res. 6, 19 (1996)). By this analysis, 
wc found that AtHXKl is located on chromosome 4 and is flanked by the 
1 0 chromosomal markers mi232 and g8300 (Fig. 1 C), and that AtHXK2 is located on 
chromosome 2 and is flanked by the chromosomal markers mil48 and mi238 (Fig. 
1C). 

The copy number of the AtHXK genes was determined by genomic DNA 
(Southern) blot analysis. Genomic DNA was prepared according to standard methods 

1 5 from A. thaliana (Landsberg er), digested with Bgtil, EcoRl, HindHl, or Xbal, 

fractionated by gel electrophoresis, transferred to a nylon membrane, and hybridized 
with a full-length cDNA randomly-primed probe of either AtHXKl or AtHXK2 
(Ausubel et al., infra). Genomic DNA blot analysis revealed that AtHXKl hybridized 
at high stringency with two DNA fragments which corresponded to the two AtHXK 

20 genes (Fig. ID, blot designated AtHXKl). In addition, at least one other fragment was 
visible on the same blot when AtHXK2 was used as a probe under identical 
hybridization conditions (Fig. ID, blot designated AMXK2 Y Using this same 
approach, a third cDNA (AtHXK3) was also identified, further supporting the 
hypothesis that three homologous HXK genes exist in Arabidopsis. At low stringency 

2 5 conditions, a large number of additional bands were also detected, suggesting that 
more than three genes share sequence similarity with AtHXKl (Fig. IE). 
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AtHXK Gene Expression 

To examine AtHXK gene expression, RNA blot experiments were performed 
as follows. RNA was extracted from rosette leaves, cauline leaves, stems, flowers, 
siliques, and roots according to standard methods. The extracted RNAs were gel 
fractionated and transferred to a nylon membrane (for example, as described in 
Ausubel et al., infra). Blots were subsequently hybridized with either AtHXKl, 
AtHXK2 y or ubiquitin (UBQ) probes (Greenberg et al., Cell 77, 551 (1994)) according 
to standard techniques. The UBQ probe was used as a control in these experiments. 
Equal amounts of RNA were loaded in each lane. 

RNA blot analyses indicated that both AtHXKl and AtHXK2 probes detected 
RNA bands of approximately 2 kb in length. As shown in Fig. 2 A, the transcript 
levels of both AtHXKl and AtHXK 2 were greatest in the siliques, moderate in flowers 
and rosette leaves, and lowest in the stem and cauline leaves. In roots, AtHXKl 
expression was greater than AtHXKl. The varied levels of AtHXKl and AtHXK2 
expression may reflect their diverse physiological roles, for example, feedback 
regulation of photosynthesis in source tissues (i.e., sugar providers) such as rosette 
leaves, and sugar metabolism in sink tissues (i.e., sugar acceptors) such as siliques 
and flowers. 

Since light is required for plants to produce sugars by photosynthesis, we 
investigated the effect of light on AtHXK gene expression. RNA blot analyses were 
carried out as described above using total RNA prepared from dark-grown etiolated 
and light-grown dark-adapted wild-type plants, with or without illumination 
(designated in Fig. 2B as Dark and Light, respectively). In particular, dark-grown 
etiolated seedlings were germinated and grown on plates containing 1/2 Murashige- 
Skoog (MS) medium, with or without 6% glucose (designated in Fig. 2B as +Sugar 
and -Sugar, respectively). Plants were grown in the dark for 6 days and then exposed 
to white light (120 nE m'h' 1 ) for 4 hours. Light-grown, dark-adapted plants consisted 
of fifteen-day-old light-grown green plants which were dark-adapted for 3 days and 
remained in the dark, or were illuminated and then flushed with 3% glucose 
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(designated in Fig. 2B as Light+Sugar). Growth conditions were as described by 
Cheng et al. (Proc. Natl. Acad. Sci. t U.S.A. 89, 1861 (1992)). 

As shown in Fig. 2B, both AtHXKl and AtHXK2 were found to be expressed 
at very low levels in non-photosynthetic etiolated seedlings, even after 4 hours of 
5 illumination. However, their expression was induced by the addition of exogenous 
sugar. The transcript levels of both AtHXKl and AtHXKl were low in dark-adapted 
green plants, but was induced significantly upon illumination and further enhanced by 
sugar (Fig. 2B). UBQ gene expression was found to be affected by both light and 
sugar. These results revealed that AtHXK expression was tied to the conditions in 

1 0 which sugar-sensing and metabolism were needed, indicating that plant sugar 

homeostasis was controlled by AtHXK levels through an autoregulatory mechanism. 
AtHXK as a Sugar Sensor in Plant^ 

To test the hypothesis that AtHXKs act as sugar sensors in intact plants, 
transgenic plant models were established by introducing sense and antisense genes to 

15 alter AtHXK levels. Wild-type (Bensheim) Arabidopsis plants were transformed with 
binary vectors carrying gene fusions with the CaMV35S promoter and sense AtHXKl 
(sense-AtHXKl\ sense AtHXK2 {sense-AtHXK2\ antisense AtHXKl (anti- AtHXKl), 
or antisense AtHXKl (anti-AtHXK2) using a standard Agrobacterium-medialed root 
transformation protocol (Czako et al., Mol Gen. Genet. 235, 33 (1992)). The 

2 0 presence of a transgene was determined by NPT1I expression and resultant kanamycin 
resistance, and by DNA blot analysis. Several transgenic lines of the T3 generation 
homozygous for the sense or antisense transgenes were selected for further analyses 
Sugar sensitivity of transgenic plants was examined by performing bioassays 
using 6% glucose or 0.8mM 2-deoxyglucose (2-dGlc), a nonmetabolizable glucose 

2 5 analog. On 6% glucose plates, greening and expansion of cotyledons, initiation of 
true leaves, and elongation of hypocotyl and root were suppressed in control (wild- 
type) Arabidopsis seedlings grown under constant light (Fig. 3A, right). These 
inhibitory effects caused by glucose were observed in six different Arabidopsis 
ecotypes including Bensheim (BE), C24, Columbia (Col), Landsberg erecta (Ler), 
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RLD, and Wassilewskija (WS) (data not shown). In addition, the greening of 
cotyledons was found to be inhibited at a low concentration of 2-dGlc in control 
plants (Fig. 3B, right). This phenotype was consistent with the finding that 2-dGlc 
was capable of acting as a potent sugar signal that could trigger global repression of 
genes encoding photosynthetic proteins. 

Compared to control plants, sense-AtHXKJ plants showed hypersensitivity to 
6% glucose as indicated by stunted growth of the cotyledons, hypocotyl, and root 
(Fig. 3 A, left). In contrast, anti-AtHXKl plants turned green and elongated normally 
(Fig. 3A, middle), indicating that they were less sensitive to sugar. Fig. 3C illustrates 
that sugar hypersensitivity and insensitivity were displayed homogeneously in the T3 
transgenic plant populations. As in the glucose assay, sense-AtHXKl plants were 
hypersensitive to 2-dGlc as shown by the severe inhibition of greening of cotyledons 
(Fig. 3B, left; Fig. 3D, left). Anti-AtHXKl plants were less sensitive to sugar and 
appeared green when germinated in the presence of 2-dGlc (Fig. 3B, middle; Fig. 3D, 
right). 

As shown in Table 1 (below), similar phenotypes were observed in multiple 
independent transgenic lines generated with either sense or antisensc AtHXKl or 
AtHXK2. The scored phenotypes shown in Table 1 were determined based on the 
examination of light-grown, 7-day-old seedlings which were germinated on 1/2 MS 
plates containing either 6% glucose or 0.8mM of 2-dGlc. Sugar insensitive (Ins), 
hypersensitive (Hyp), and ambiguous (A) phenotypes were scored and tabulated. The 
results of this analysis are presented in Table 1 (below). 



Table 1 

Sugar sensitivity in T3 homozygous transgenic plants. 
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'Sugars insensitivity is believed to result from eo- suppress ion. 
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To rule out the possibility that the difference in sugar sensing between 
transgenic and control plants was due to an osmotic effect, mannitol and 3-0- 
methylglucosc (3-MeGlc) were used in control experiments. No apparent difference 
was observed between control and transgenic plants when plated on 6% mannitol 
5 (Fig. 3E) or 6% 3-MeGlc, a glucose analog which is not phosphorylated by HXK 

(Fig. 3F). Together, these results indicated that sugar sensing in transgenic plants was 
specific, because neither mannitol nor 3-MeGIc were able to replace glucose and 
interact with AtHXKs. 

AtHXK Mediates Sugar Effects on Plant Gro wth and Development 
10 We next compared the effects of sugar on hypocotyl and cotyledon 

development in both wild-type and transgenic plants. For these experiments, 
Arabidopsis seedlings were grown on plates containing 2-6% glucose for six days in 
the dark. Since hypocotyl elongation occurs more in the dark, the inhibitory effect 
caused by sugar could be visually evaluated. Because light triggers cotyledon 
1 5 expansion and greening, these dark-grown seedlings were exposed to light for 12 
hours to determine the effect of sugar on cotyledon development. Our results are 
shown in Figure 4. 

In particular, in control plants, the hypocotyl length was inversely proportional 
to the glucose concentration (Fig. 4A). Under similar growth conditions, the sense- 

2 0 AtHXK plants were hypersensitive to sugar as revealed by the reduction of hypocotyl 
length when grown in the presence of 3-6% glucose (Fig. 4B). In contrast, ami- 
AtHXK plants were able to elongate even in the presence of 5 or 6% glucose (Fig. 
4C). Although glucose concentrations below 2% promoted seedling growth in the 
presence of other nutrients (data not shown), hypocotyl inhibition by glucose 

2 5 concentrations above 2% reflected sugar-sensing mediated through AtHXK. 

In contrast to light, glucose (at 4-6%) suppressed cotyledon greening and 
expansion in control plants (Fig. 4A). In sense-AtHXK plants, the impairment was 
greater as indicated by etiolated cotyledons (Fig. 4B). However, anti-AtHXK plants 
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were found to be less sensitive to all glucose concentrations and turned green 
normally (Fig. 4C). Sugar inhibition of cotyledon development is explained by the 
plant's ability to switch to heterotrophic growth in the presence of abundant external 
sugars rather than photoautotrophic growth for which cotyledon expansion and 
5 greening are required. We also observed that there was a lack of sugar inhibition in 
dark-grown roots of control and antisense seedlings (Fig. 4A and Fig. 4C). This could 
be the consequence of low AtHXK expression in roots in the dark because ectopic 
AtHXK expression conferred glucose-dependent inhibition of root growth in sense 
plants (Fig. 4B). These results indicated that distinct sugar responses could occur in 
1 0 different tissues due to differential expression of AtHXK (Fig. 2). 

AtHXK Mediates Sugar Repression and Acti vation of Gene Expression 

To determine whether HXK was involved in sugar regulation of gene 

expression, we compared the expression levels of two sugar-repressible genes, the 

light-harvesting chlorophyll a/b binding protein (CAB1) and the ribulose bisphosphate 
1 5 carboxylase small subunit (RBCS) y and one sugar-inducible gene, the nitrate reductase 

(NR1) gene in control, sense-, and anti-A tHXK plants (Sheen, Photosynthesis Res. 39, 

427 (1994); Thomas and Rodriguez, Plant Physiol. 106, 1235 (1994); Cheng et aL, 

Proc. Natl Acad. ScL U.S.A. 89, 1861 (1992)). 

We first examined illuminated dark-grown etiolated seedlings propagated on 
2 0 6% glucose as described above. RNA blot analyses were performed using 1 . 1 kb 

G45V, 0.5 kb RBCS y and 3.2 kb NRJ probes from Arabidopsis; also as described 

above. 

The transcription levels of CAB I and RBCS were low in control plants, and 
nearly abolished in both sense-AtHXKl and sense-AtHXK2 plants exhibiting sugar 
2 5 hypersensitivity (Fig. 5 A). In contrast, both genes were expressed at high levels in 

both anti-AtHXK plants indicating sugar insensitivity. Consistent with the notion that 
sense transgenic plants are hypersensitive to sugars, NR1 was activated in both sense- 
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AtHXKl and sense- AtHXK2 plants, but not in anti-AtHXK or control plants (Fig. 5 A). 
Together, these data indicated that AtHXK was the sensor which mediated both sugar- 
repressible and sugar-inducible gene expression in higher plants. In addition, the 
transcription levels of CAB J and RBCS were similar in sense-AtHXKs and control 
5 plants grown in the absence of sugars (data not shown), indicating that sugar sensing 
by the AtHXKs was specific, and that exogenous sugar was at least one signal used in 
illuminated etiolated seedlings. 

To show that AtHXK regulated gene expression under physiological 
conditions, we examined CAB J and RBCS expression in light-grown green plants 

10 without the addition of exogenous sugars (as described above). The results from these 
experiments showed that the transcript levels of both genes were nearly five-fold 
lower in sense-AtHXK plants than in anti-AtHXK ox control plants (Fig. 5B). This 
differential expression was perhaps due to the repression of light inducibility by 
endogenous sugars mediated through the increased expression of AtHXK, because 

15 both genes showed uniformly low expression in the dark in transgenic and control 
plants (Fig. 5B) (data not shown for AtHXK2). 

Altered AtHXK Expression in Transgenic Plants 

To confirm that the observed sugar hypersensitivity or insensitivity in 
transgenic plants correlated with transgene expression, RNA and protein blot analyses 

2 0 were conducted as follows. RNA blot analyses were performed using gene- and 

strand-specific probes for the sense and antisense constructs which were expressed in 
the transgenic plants. Probes were synthesized using a polymerase chain reaction 
(PCR) method described by Greenberg et al. {Cell 77, 551 (1994)). Oligonucleotides 
for use as PCR primers were designed from the sequence of AtHXKl (SEQ ID NO: 3) 

2 5 and AtHXK2 (SEQ ID NO: 4), and were used to amplify their respective cDNA 
fragments. The sense primers were 5 '- ATGGGT AAAGT AGCTGTT- 3 1 (SEQ ID 
NO: 10) and 5'-ATGGGTAAAGTGGCAGTTGCAA-3' (SEQ ID NO: 1 1) for 
AtHXKl and AtHXK2 y respectively. The antisense primers were 5'- 



-21- 



WO 97/35965 



PCT/US97/04712 



TTAAGAGTCTTCAAGGTAGAG-3' (SEQ ID NO: 12) and 5'- 
TTAACTTGTTTCAGAGTCATCTTC-3' (SEQ ID NO: 13) for AtHXKl and 
AtHXKl, respectively. Plasmids (pBluescript™ KS+) containing either AtHXKl or 
AtHXK2 full-length cDNAs were used as templates for the PCR reactions. RN As 
were extracted from illuminated etiolated seedlings grown on 6% glucose plates, gel 
fractionated, blotted to nylon membranes, and hybridized with each probe as 
described above. In illuminated etiolated seedlings, the transcript levels of AtHXKl 
(sense- 1) in sense- AtHXKl plants and AtHXKl (sense-2) in sense- AtHXK2 plants 
were more than 20- fold higher than control plants (Fig. 5C). In antisense plants, 
antisense RNAs of AtHXKl (anti-1 ) and AtHXKl (anti-2) were expressed in their 
respective antisense transgenic plants. In contrast, the endogenous transcript of 
AtHXKl in anti-AtHXKl and anti- AtHXKl plants was reduced to less than 20% of the 
control level (Fig. 5C). Longer exposure of the sense-2 blot revealed that the 
endogenous AtHXKl expression in anti-AtHXKl and anti-AtHXKl plants was also 
reduced (data not shown). These results indicated that either antisense AtHXKl or 
antisense AtHXK2 RNA was capable of reducing the endogenous RNA levels of both 
AtHXKl and AtHXKl, presumably because of the high level of sequence identity. 
Similar results were obtained when 1 5-day-old, light-grown green transgenic plants 
were analyzed (data not shown). 

Protein blot analyses were also performed with seedlings which were 
germinated and grown in the dark for 6 days on plates containing 1/2 MS medium 
with or without 6% glucose, and then exposed to white light (120 ^iE m" 2 s ! ) for 4 
hours. These analyses were also performed using protein extracted from fifteen-day- 
old light-grown green plants which were dark adapted for 3 days and then illuminated 
for 4 hours. Antibodies which were used in these experiments were prepared as 
follows. AtHXKl containing the entire open reading frame was subcloned into the 
plasmid vector pET-19b (Novagen) for ovcrcxpression in Escherichia coli according 
to standard methods. Overexpressed AtHXKl was then gel purified and used for the 
production of rabbit polyclonal antibodies. The antibodies were affinity purified, and 
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protein blot analyses were performed using the Phototope™-Star Western Blot 
Detection Kit (New England Biolabs). Protein was extracted according to 
conventional methods (Wei ct al. f Cell 78, 1994; Tots et al. EMBOJ. 6, 1843 (1987)). 
The results of the protein blot experiments showed that AtHXKl expression 
5 was 5 to 10 fold higher in sense-AtHXKl than in control plants. In anti- AtHXKl 

plants, the level of AtHXKl was significantly lower than in control plants, although it 
was not completely eliminated (Fig. 5D). 

Hexose Phosphorylation Activities in Transgenic Plants 

To determine whether the altered AtHXK expression affected the total 
10 catalytic activities of hexose phosphorylation in transgenic plants, we performed a 

series of standard hexose phosphorylation assays as described by Renz and Stitt 

(Planta 190, 166(1993)). 

In illuminated etiolated seedlings (grown as described above), sense-AtHXKl 

plants were found to possess the highest hexose phosphorylation activity, whereas 
15 other plants displayed lower activities (Fig. 5E). The higher activity detected in 

plants having increased expression of the AtHXK gene was consistent with the result 

of a yeast transformation experiment indicating that AtHXKl had higher catalytic 

activity than AtHXK2 (data not shown). 

We also carried out the enzymatic assay using fifteen-day-old, light-grown 
2 0 green plants that were dark-adapted for 3 days and illuminated for 4 hours. As shown 

in Fig. 5F, both sense-AtHXKl and sense-AtHXK2 plants had higher hexose 

phosphorylation activities than anti-AtHXK and control plants. Together, these data 

provided evidence for the conclusion that the manipulation of AtHXK expression is 

sufficient to alter sugar-sensing and sugar-regulated activities in Arabidopsis. 
2 5 Therefore, the specific interaction between sugars and HXK, for example, AtHXKl 

and AtHXK2, but not the total catalytic activity of HXK, was shown to be a key 

determinant of sugar-sensing mechanisms in plants. 
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Sugar Signaling is Uncoupled from Sugar Metabolism in Plants 

The above observations suggested the existence of a regulatory function for 
HXK, and the uncoupling of sugar signaling from metabolism in plants. To confirm 
this hypothesis, we sought to increase the expression of a heterologous HXK that 
5 would provide excess catalytic activity for sugar metabolism, but no regulatory 
function. The yeast HXK2 (YHXK2) has been proposed to have catalytic and 
regulatory functions, and appeared to be a good candidate for this experiment (Entian, 
MoL Gen. Genet 178, 633 (1980); Entian and Frshlich, J. BacterioL 158, 29 (1984); 
Entian et al., MoL Cell Biol. 5, 3035 (1985)). We first determined whether the 

10 putative regulatory functions of the YHXK2 and AtHXK were interchangeable by 

examining the effect of increasing the expression of AtHXK on glucose repression in 
a yeast hxkl/hxkl double mutant (DBY221 9). The assay was based on the YHXK2- 
mediated growth inhibition (i.e., glucose repression) of wild-type yeast cells on a 2- 
dGlc/raffinose plate. The glucose repression assay was performed using a YP plate 

15 with 2% raffinose as the carbon source in the presence of 2-deoxyglucose (0.02%) as 
described by Ma et al. {MoL Cell. Bioi 9, 5643 (1989)). The glucose analog, 2-dGIc, 
mimics glucose by inducing strong repression of the invertase gene (SUC2), but is 
itself unavailable for use as a carbon source. Thus, wild-type yeast strains exhibit 
glucose repression and cannot grow under these assay conditions; in contrast, in the 

2 0 hxkl/hxkl double mutant, SUC2 expression is derepressed, allowing raffinose 

hydrolysis and the release of fructose for growth on assay plates. As shown in Fig. 
6A, DBY2219 grew on a 2-dGlc/raffinose plate due to the lack of YHXK2 and 
derepression of invertase gene expression. However, growth of this strain was 
inhibited upon transformation with YHXK2 and restoration of glucose repression 

2 5 (Fig. 6A) (Ma and Botstein, MoL CelL BioL 6, 4046 ( 1 986); Ma et al., MoL CelL BioL 
9, 5643(1989)). 

To determine the effect of increasing the expression of YHXK2 in plants, a 
transgene construct, pCaMV35S-YHXK2, that expressed YHXK2 was introduced into 
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Arabidopsis using the Agrobacterium-medizted protocol described above. Transgenic 
plants having increased expression of YHXK2 (YHXK2 plants) were observed to 
display sugar insensitivity in many assays. For instance, YHXK2 seedlings were less 
sensitive to 6% glucose than control and sense-AtHXK plants. Hypocotyl elongation, 
5 root growth, and greening of cotyledons were found to be inhibited in sense-AtHXKl 
or control plants, but not in YHXK2 plants (Fig. 6B). RNA blot analysis showed that 
CAB1 and RBCS transcripts were not repressed in YHXK2 plants, whether grown with 
or without exogenous glucose (data not shown). 

To insure that YHXK2 provided hexose phosphorylation activity in plants, 

1 0 enzymatic assays were conducted using both etiolated and green transgenic plants. 
Fig. 6C shows that total hexose phosphorylation activity was much higher in YHXK2 
than in control plants, and similar to or higher than sense-AtHXK plants. However, 
YHXK2 plants were sugar-insensitive rather than hypersensitive (Fig. 6B). This 
dominant interfering effect of YHXK2 in transgenic plants presumably resulted from 

1 5 the increased expression i of YHXK2 which competed with AtHXK for sugars, but 
which was incapable of transmitting a signal. We ruled out the possibility of gene 
silencing effects based on the normal expression of AtHXK J and ATHXK2 in YHXK2 
plants (data not shown). 

These experiments indicated that the catalytic function of HXK was 

2 0 interchangeable between yeast and plants, but not the regulatory function for sugar- 
signaling. Our recent results have also shown that a third AtHXK does not 
complement HXK regulatory function (data not shown). Thus, glucose-signaling 
does not require extensive metabolism and is diminished when YHXK2 is 
overexpressed in plants. 

25 In sum, we have found that HXK mediates sugar-sensing in higher plants 

based on the analyses of transgenic plants with gain or loss of AtHXK function, and a 
dominant interfering YHXK2. 
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Isolation of Other HXK cDNAs an d Genomic DNAs 

Based on the isolation described herein of the aforementioned HXK genes and 
polypeptides, the isolation of additional plant HXK coding sequences is made 
possible using standard strategies and techniques that are well known in the art. For 
5 example, using all or a portion of the amino acid sequence of an HXK polypeptide, 
one may readily design //AX-specific oligonucleotide probes, including HXK 
degenerate oligonucleotide probes (i.e., a mixture of all possible coding sequences for 
a given amino acid sequence). These oligonucleotides may be based upon the 
sequence of either DNA strand and any appropriate portion of the HXK sequence (for 

10 example, Fig. 1F-G; SEQ ID NOS: 3 and 4, respectively). General methods for 
designing and preparing such probes are provided, for example, in Ausubel et al., 
1996, Current Protocols in Molecular Biology, Wiley Interscience, New York, and 
Berger and Kimmel, Guide to Molecular Cloning Techniques, 1987, Academic Press, 
New York. These oligonucleotides are useful for HXK gene isolation, either through 

15 their use as probes capable of hybridizing to HXK complementary sequences or as 
primers for various amplification techniques, for example, polymerase chain reaction 
(PCR) cloning strategies. 

Hybridization techniques and screening procedures are well known to those 
skilled in the art and are described, for example, in Ausubel et al. (supra); Berger and 

20 Kimmel (supra); and Sambrook et al, Molecular Cloning: A Laboratory Manual, 
Cold Spring Harbor Laboratory Press, New York. If desired, a combination of 
different oligonucleotide probes may be used for the screening of a recombinant DNA 
library. The oligonucleotides may be detectably-labeled using methods known in the 
art and used to probe filter replicas from a recombinant DNA library. Recombinant 

25 DNA libraries are prepared according to methods well known in the art, for example, 
as described in Ausubel et al. (supra), or they may be obtained from commercial 
sources. 

For detection or isolation of closely related HXK sequences having greater 
than 80% identity, high stringency conditions are preferably used; such conditions 
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include hybridization at about 65 °C and about 50% formamide, a first wash at about 
65 °C, about 2X SSC, and 1% SDS, followed by a second wash at about 65 °C and 
about 0. 1 % SDS, and 1 X SSC. Lower stringency conditions for detecting HXK genes 
having about 30-50% sequence identity to the HXK genes described herein include, 
5 for example, hybridization at about 45 °C in the absence of formamide, a first wash at 
about 45°C, about 6X SSC, and about 1% SDS, and a second wash at about 50°C, 
about 6X SSC, and about 1% SDS. These stringency conditions are exemplary; other 
appropriate conditions may be determined by those skilled in the art. 

As discussed above, HXK oligonucleotides may also be used as primers in 

10 amplification cloning strategies, for example, using PCR. PCR methods are well 
known in the art and are described, for example, in PCR Technology, Erlich, ed., 
Stockton Press, London, 1989; PCR Protocols: A Guide to Methods and Applications, 
Innis et al., eds.. Academic Press, Inc., New York, 1990; and Ausubel et al. (supra). 
Primers are optionally designed to allow cloning of the amplified product into a 

15 suitable vector, for example, by including appropriate restriction sites at the 5' and 3' 
ends of the amplified fragment (as described herein). If desired, HXK may be isolated 
using the PCR "RACE" technique, or Rapid Amplification of cDNA Ends (see, e.g., 
Innis et al. {supra)). By this method, oligonucleotide primers based on an HXK 
sequence are oriented in the 3' and 5' directions and are used to generate overlapping 

20 PCR fragments. These overlapping 3*- and 5'-end RACE products are combined to 
produce an intact full-length cDNA. This method is described in Innis et al. (supra); 
and Frohman et al., Proc. Natl. Acad. Sci. USA 85, 8998, (1988). 

Alternatively, any plant cDNA expression library may be screened by 
functional complementation of a yeast hxkJ/hxk2 double mutant as described herein 

25 by Ma and Botstein , Mol. Cell. Biol. 6, 4046 ( 1 986)). 

Useful HXK sequences may be isolated from any appropriate organism. 
Confirmation of a sequence's relatedness to the HXK polypeptide family may be 
accomplished by a variety of conventional methods including, but not limited to, 
functional complementation assays and sequence comparison. In addition, the 
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activity of any HXK sequence may be evaluated according to any of the techniques 
described herein. 

Polypeptide Expression 

HXK polypeptides may be produced by transformation of a suitable host cell 
5 with all or part of an HXK cDNA (for example, the cDNA described above) in a 

suitable expression vehicle or with a plasmid construct engineered for increasing the 
expression of an HXK polypeptide (supra) in vivo. 

Those skilled in the field of molecular biology will understand that any of a 
wide variety of expression systems may be used to provide the recombinant protein. 
1 0 The precise host cell used is not critical to the invention. The HXK protein may be 
produced in a prokaryotic host, for example, 

£. colU or in a eukaryotic host, for example, Saccharomyces cerevisiae, mammalian 
cells (for example, COS 1 or NIH 3T3 cells), or any of a number of plant cells 
including, without limitation, algae, tree species, ornamental species, temperate fruit 

15 species, tropical fruit species, vegetable species, legume species, monocots, dicots, or 
in any plant of commercial or agricultural significance. Particular examples of 
suitable plant hosts include, but are not limited to, Conifers, Petunia, Tomato, Potato, 
Tobacco, Arabidopsis, Lettuce, Sunflower, Oilseed rape, Flax, Cotton, Sugarbeet, 
Celery, Soybean, Alfalfa, Medicago, Lotus, Vigna y Cucumber, Carrot, Eggplant, 

2 0 Cauliflower, Horseradish, Morning Glory, Poplar, Walnut, Apple, Asparagus, Rice, 
Maize, Millet, Onion, Barley, Orchard grass, Oat, Rye, and Wheat. 

Such cells are available from a wide range of sources including: the American 
Type Culture Collection (Rockland, MD); or from any of a number seed companies, 
for example, W. Atlee Burpee Seed Co. (Warminster, PA), Park Seed Co. 

2 5 (Greenwood, SC), Johnny Seed Co. (Albion, ME), or Northrup King Seeds 

(Harstville, SC). Descriptions and sources of useful host cells are also found in Vasil 
I.K., Cell Culture and Somatic Cell Genetics of Plants, Vol I, II, III Laboratory 
Procedures and Their Applications Academic Press, New York, 1984; Dixon, R.A., 



-28- 



WO 97/35965 



PCT/US97/04712 



Plant Cell Culture-A Practical Approach, IRL Press, Oxford University, 1985; Green 
et ah, Plant Tissue and Cell Culture, Academic Press, New York, 1987; and Gasser 
and Fraley, Science 244, 1293, (1989). 

For prokaryotic expression, DNA encoding an HXK polypeptide is carried on 
5 a vector operably linked to control signals capable of effecting expression in the 
prokaryotic host. If desired, the coding sequence may contain, at its 5' end, a 
sequence encoding any of the known signal sequences capable of effecting secretion 
of the expressed protein into the periplastic space of the host cell, thereby facilitating 
recovery of the protein and subsequent purification. Prokaryotes most frequently used 

10 are various strains of £. coli; however, other microbial strains may also be used. 
Plasmid vectors are used which contain replication origins, selectable markers, and 
control sequences derived from a species compatible with the microbial host. 
Examples of such vectors are found in Pouwcls et al. (supra) or Ausubel et al. 
(supra). Commonly used prokaryotic control sequences (also referred to as 

15 "regulatory elements") are defined herein to include promoters for transcription 

initiation, optionally with an operator, along with ribosome binding site sequences. 
Promoters commonly used to direct protein expression include the beta-lactamase 
(penicillinase), the lactose (lac) (Chang et al., Nature 198, 1056 (1977)), the 
tryptophan (Trp) (Goeddel et al., Nucl. Acids Res. 8, 4057 (1980)), and the tac 

2 0 promoter systems, as well as the lambda-derived P L promoter and N-gene ribosome 
binding site (Simatake et al., Nature 292, 128 (1981)). 

One particular bacterial expression system for HXK polypeptide production is 
the E. coli pET expression system (Novagen, Inc., Madison, WI). According to this 
expression system, DNA encoding an HXK polypeptide is inserted into a pET vector 

2 5 in an orientation designed to allow expression. Since the HXK gene is under the 
control of the T7 regulatory signals, expression of HXK is induced by inducing the 
expression of T7 RNA polymerase in the host cell. This is typically achieved using 
host strains which express T7 RNA polymerase in response to 1PTG induction. Once 
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produced, recombinant HXK polypeptide is then isolated according to standard 
methods known in the art, for example, those described herein. 

Another bacterial expression system for HXK polypeptide production is the 
pGEX expression system (Pharmacia). This system employs a GST gene fusion 
5 system which is designed for high-level expression of genes or gene fragments as 
fusion proteins with rapid purification and recovery of functional gene products. The 
protein of interest is fused to the carboxyl terminus of the glutathione S-transferase 
protein from Schistosoma japonicum and is readily purified from bacterial lysates by 
affinity chromatography using Glutathione Sepharose 4B. Fusion proteins can be 

1 0 recovered under mild conditions by elution with glutathione. Cleavage of the 

glutathione S-transferase domain from the fusion protein is facilitated by the presence 
of recognition sites for site-specific proteases upstream of this domain. For example, 
proteins expressed in pGEX-2T plasmids may be cleaved with thrombin; those 
expressed in pGEX-3X may be cleaved with factor Xa. 

15 For eukaryotic expression, the method of transformation or transfection and 

the choice of vehicle for expression of the HXK polypeptide will depend on the host 
system selected. Transformation and transfection methods are described, e.g., in 
Ausubel et al. (supra); Weissbach and Weissbach, Methods for Plant Molecular 
Biology, Academic Press, 1989; Gelvin et al, Plant Molecular Biology Manual, 

20 Kluwer Academic Publishers, 1990; Kindle, K., Proc. Natl Acad. ScL, U.S. A 87, 
1228 (1990); Potrykus, 1., Annu. Rev. Plant Physiol Plant Mol Biology 42, 205 
(1991); and BioRad (Hercules, CA) Technical Bulletin #1687 (Biolistic Particle 
Delivery Systems). Expression vehicles may be chosen from those provided, e.g., in 
Cloning Vectors: A Laboratory Manual (P.H. Pouwels et al., 1985, Supp. 1987); 

2 5 Gasser and Fraley ( supra ): Clontech Molecular Biology Catalog (Catalog 1 992/93 
Tools for the Molecular Biologist, Palo Alto, CA); and the references cited above. 

One preferred eukaryotic expression system is the mouse 3T3 fibroblast host 
cell transfected with a pMAMneo expression vector (Clontech). pMAMneo provides: 
an RSV-LTR enhancer linked to a dexamcthasone-inducible MMTV-LTR promotor, 



-30- 



WO 97/35965 



PC17US97/04712 



an SV40 origin of replication which allows replication in mammalian systems, a 
selectable neomycin gene, and SV40 splicing and polyadenylation sites. DNA 
encoding an HXK polypeptide is inserted into the pMAMneo vector in an orientation 
designed to allow expression. The recombinant HXK protein is then isolated as 
5 described below. Other preferable host cells which may be used in conjunction with 
the pMAMneo expression vehicle include COS cells and CHO cells (ATCC 
Accession Nos. CRL 1650 and CCL 61, respectively). 

Alternatively, if desired, an HXK polypeptide is produced by a stably- 
transfected mammalian cell line. A number of vectors suitable for stable transfection 

10 of mammalian cells are available to the public, e.g., see Pouwels et al. (supra); 

methods for constructing such cell lines are also publicly available, e.g., in Ausubel et 
al. (supra). In one example, cDNA encoding the HXK polypeptide is cloned into an 
expression vector which includes the dihydrofolate reductase (DHFR) gene. 
Integration of the plasmid and, therefore, the HXK-encoding gene into the host cell 

1 5 chromosome is selected for by inclusion of 0.0 1 -300 methotrexate in the cell 

culture medium (as described in Ausubel et al., supra). This dominant selection can 
be accomplished in most cell types. Recombinant protein expression can be increased 
by DHFR-mediated amplification of the transfected gene. Methods for selecting cell 
lines bearing gene amplifications are described in Ausubel et al. (supra); such 

2 0 methods generally involve extended culture in medium containing gradually 

increasing levels of methotrexate. DHFR-containing expression vectors commonly 
used for this purpose include pCVSEII-DHrF and pAdD26SV(A) (described in 
Ausubel et al., supra). Any of the host cells described above or, preferably, a DHFR- 
deficient CHO cell line (for example, CHO DHFR cells, ATCC Accession No. CRL 

2 5 9096) are among the host cells preferred for DHFR selection of a stably-transfected 
cell line or DHFR-mediated gene amplification. 

Most preferably, an HXK polypeptide is produced by a stably-transfected 
plant cell line or by a transgenic plant. A number of vectors suitable for stable 
transfection of plant cells or for the establishment of transgenic plants are available to 
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the public; such vectors are described in Pouwels et al. (supra), Weissbach and 
Weissbach (supra), and Gelvin et al. (supra). Methods for constructing such cell lines 
are described in, e.g., Weissbach and Weissbach (supra), and Gelvin et al. (supra). 
Typically, plant expression vectors include (1) a cloned plant gene under the 
5 transcriptional control of 5' and 3 ! regulatory sequences and (2) a dominant selectable 
marker. Such plant expression vectors may also contain, if desired, a promoter 
regulatory region (for example, one conferring inducible or constitutive, 
environmentally- or developmentally-regulated, or cell- or tissue-specific expression), 
a transcription initiation start site, a ribosome binding site, an RNA processing signal, 

10 a transcription termination site, and/or a polyadenylation signal. 

Alternatively, the HXK polypeptide may be produced using a transient 
expression system (e.g., the maize transient expression system described by Sheen 
Plant Cell 2, 1027(1990)). 

Once the desired HXK nucleic acid sequences is obtained it may be 

15 manipulated in a variety of ways known in the art. For example, where the sequence 
involves non-coding flanking regions, the flanking regions may be subjected to 
mutagenesis. 

The HXK DNA sequence of the invention may, if desired, be combined with 
other DNA sequences in a variety of ways. The HXK DNA sequence of the invention 
2 0 may be employed with all or part of the gene sequences normally associated with the 
HXK protein. In its component parts, a DNA sequence encoding an HXK protein is 
combined in a DNA construct having a transcription initiation control region capable 
of promoting transcription and translation in a host cell. 

In general, the constructs will involve regulatory regions functional in plants 
25 which provide for modified production of HXK protein as discussed herein. The open 
reading frame coding for the HXK protein or functional fragment thereof will be 
joined at its 5 1 end to a transcription initiation regulatory region such as the sequence 
naturally found in the 5' upstream region of the HXK structural gene. Numerous other 
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transcription initiation regions are available which provide for constitutive or 
inducible regulation. 

For applications when developmental, cell, tissue, hormonal, or environmental 
expression is desired, appropriate 5 1 upstream non-coding regions are obtained from 
5 other genes; for example, from genes regulated during seed development, embryo 
development, or leaf development. 

Regulatory transcript termination regions may also be provided in DNA 
constructs of this invention as well. Transcript termination regions may be provided 
by the DNA sequence encoding the HXK protein or any convenient transcription 

1 0 termination region derived from a different gene source. The transcript termination 
region will contain preferably at least 1-3 kb of sequence 3' to the structural gene from 
which the termination region is derived. Plant expression constructs having HXK as 
the DNA sequence of interest for expression (in either the sense or antisense 
orientation) thereof may be employed with a wide variety of plant life, particularly 

1 5 plant life involved in the production of storage reserves (for example, those involving 
carbon and nitrogen metabolism). Such genetically-engineered plants are useful for a 
variety of industrial and agricultural applications as discussed below. Importantly, 
this invention is applicable to dicotyledons and monocotyledons, and will be readily 
applicable to any new or improved transformation or regeneration method. 

20 An example of a useful plant promoter according to the invention is a 

caulimovirus promoter, for example, a cauliflower mosaic virus (CaMV) promoter. 
These promoters confer high levels of expression in most plant tissues, and the 
activity of these promoters is not dependent on virally encoded proteins. CaMV is a 
source for both the 35S and 19S promoters. In most tissues of transgenic plants, the 

25 CaMV 35S promoter is a strong promoter (see, e.g., Odell et al., Nature 313, 810 

(1985)). The CaMV promoter is also highly active in monocots (see, e.g., Dekeyser et 
al., Plant Cell 2, 591 (1990); Terada and Shimamoto, Mol. Gen. Genet. 220, 389, 
(1990)). Moreover, activity of this promoter can be further increased (i.e., between 2- 
10 fold) by duplication of the CaMV 35S promoter (see e.g., Kay et al., Science 236, 
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1299 (1987); Ow et al., Proc. Natl. Acad. Sci., U.S.A. 84, 4870 (1987); and Fang et 
\l, Plant Cell 1, 141 (1989)). 

Other useful plant promoters include, without limitation, the nopaline synthase 
promoter (An et al., Plant Physiol. 88, 547 (1988)) and the octopine synthase 
5 promoter (Fromm et al., Plant Cell 1, 977 (1989)). 

For certain applications, it may be desirable to produce the HXK gene product 
in an appropriate tissue, at an appropriate level, or at an appropriate developmental 
time. For this purpose, there are an assortment of gene promoters, each with its own 
distinct characteristics embodied in its regulatory sequences, shown to be regulated in 

10 response to the environment, hormones, and/or developmental cues. These include 
gene promoters that are responsible for heat-regulated gene expression (see, e.g., 
Callis et al., Plant Physiol. 88, 965 (1988); Takahashi and Komeda, Mol. Gen. Genet. 
219, 365 (1989); and Takahashi et al. Plant J. 2, 751 (1992)), light-regulated gene 
expression (e.g., the pea rbcS-3A described by Kuhlemeier et al., Plant Cell 1, 471 

15 _ (1989); the maize rbcS promoter described by Schaffher and Sheen, Plant Cell 3, 997 
(1991); or the cholorphyll a/b-binding protein gene found in pea described by 
Simpson et al., EM BO J. 4, 2723 (1985)), hormone-regulated gene expression (for 
example, the abscisic acid (ABA) responsive sequences from the Em gene of wheat 
described by Marcotte et al., Plant Cell 1, 969 (1989); the ABA-inducible HVA1 and 

2 0 H V A22, and rd29A promoters described for barley and Arabidopsis by Straub et al., 
Plant Cell 6, 617 (1994), Shen et al., Plant Cell 7, 295 (1994), and Yamaguchi- 
Shinosaki et al wound-induced gene expression (for example, ofwunl described by 
Siebertz et al., Plant Cell 1, 961 (1989)), or organ-specific gene expression (for 
example, of the tuber-specific storage protein gene described by Roshal et al., EMBO 

25 J. 6, 1 1 55 (1987); the 23-kDa zein gene from maize described by Schernthaner et al., 
EMBO J. 7, 1249 (1988); or the French bean 8-phaseolin gene described by Bustos ct 
A., Plant Cell 1,839 (1989)). 

Plant expression vectors may also optionally include RNA processing signals, 
e.g, introns, which have been shown to be important for efficient RNA synthesis and 
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accumulation (Callis et al., Genes and Dev. 1, 1 183 (1987)). The location of the RNA 
splice sequences can dramatically influence the level of transgene expression in 
plants. In view of this fact, an intron may be positioned upstream or downstream of 
an HXK polypeptide-encoding sequence in the transgene to modulate levels of gene 
expression. 

In addition to the aforementioned 5' regulatory control sequences, the 
expression vectors may also include regulatory control regions which are generally 
present in the 3' regions of plant genes (Thornburg et al., Proc. Nail. Acad. Sri. U.S.A. 
84, 744 (1987); An et al.. Plant Cell 1,115 (1989)). For example, the 3 1 terminator 
region may be included in the expression vector to increase stability of the mRNA. 
One such terminator region may be derived from the PI-II terminator region of potato. 
In addition, other commonly used terminators are derived from the octopine or 
nopaline synthase signals. 

The plant expression vector also typically contains a dominant selectable 
marker gene used to identify those cells that have become transformed. Useful 
selectable genes for plant systems include genes encoding antibiotic resistance genes, 
for example, those encoding resistance to hygromycin, kanamycin, bleomycin, G418, 
streptomycin or spectinomycin. Genes required for photosynthesis may also be used 
as selectable markers in photosynthetic-deficient strains. Alternatively, the green- 
fluorescent protein from the jellyfish Aequorea victoria may be used as a selectable 
marker (Sheen et al., Plant J. 8:777, 1995; Chiu et al., Current Biology 6, 325 
(1996)). Finally, genes encoding herbicide resistance may be used as selectable 
markers; useful herbicide resistance genes include the bar gene encoding the enzyme 
phosphinothricin acetyltransferase and conferring resistance to the broad spectmm 
herbicide Basta® (Hoechst AG, Frankfurt, Germany). 

Efficient use of selectable markers is facilitated by a determination of the 
susceptibility of a plant cell to a particular selectable agent and a determination of the 
concentration of this agent which effectively kills most, if not all, of the transformed 
cells. Some useful concentrations of antibiotics for tobacco transformation include, 
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e.g., 75-100 ng/ml (kanamycin), 20-50 [ig/ml (hygromycin), or 5-10 ng/ml 
(bleomycin). A useful strategy for selection of transformants for herbicide resistance 
is described, e.g., by Vasil et al., supra. 

It should be readily apparent to one skilled in the art of molecular biology, 
especially in the field of plant molecular biology, that the level of gene expression is 
dependent, not only on the combination of promoters, RNA processing signals, and 
terminator elements, but also on how these elements are used to increase the levels of 
selectable marker gene expression. 

plant Transformation 

Upon construction of the plant expression vector, several standard methods are 
available for introduction of the vector into a plant host, thereby generation a 
transgenic plant. These methods include (1) Agrobacterium-mcdiated transformation 
(A. tumefaciens or A. rhizogenes) (see, e.g., Lichtenstein and Fuller In: Genetic 
Engineering, vol 6, PWJ Rigby, ed, London, Academic Press, 1987; and Lichtenstein, 
CP., and Draper, J,. In: DNA Cloning, Vol II, D.M. Glover, ed, Oxford, IRI Press, 
1985)), (2) the particle delivery system (see, e.g., Gordon-Kamm et ah, Plant Cell 2, 
603 (1990); or BioRad Technical Bulletin 1687, supra), (3) microinjection protocols 
(see, e.g., Green et al., supra), (4) polyethylene glycol (PEG) procedures (see, e.g., 
Draper et al., Plant Cell Physiol. 23, 451 (1982); or e.g., Zhang and Wu, Theor. Appl. 
Genet. 76, 835 (1988)), (5) liposome-mediated DNA uptake (see, e.g., Freeman et al., 
Plant Cell Physiol. 25, 1353 (1984)), (6) electroporation protocols (see, e.g., Gelvin et 
al., supra\ Dekeyser et al., supra; Fromm et al., Nature 319, 791 (1986); Sheen Plant 
Cell 2, 1027 (1990);or Jang and Sheen Plant Cell 6, 1665 (1994)), and (7) the 
vortexing method (see, e.g., Kindle supra). The method of transformation is not 
critical to the instant invention. Any method which provides for efficient 
transformation may be employed. As newer methods are available to transform crops 
or other host cells, they may be directly applied. 
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The following is an example outlining one particular technique, an 
Agrobacterium-mediated plant transformation. By this technique, the general process 
for manipulating genes to be transferred into the genome of plant cells is carried out in 
two phases. First, cloning and DNA modification steps are carried out in E. coli, and 
5 the plasmid containing the gene construct of interest is transferred by conjugation or 
electroporation into Agrobacterium. Second, the resulting Agrobacterium strain is 
used to transform plant cells. Thus, for the generalized plant expression vector, the 
plasmid contains an origin of replication that allows it to replicate in Agrobacterium 
and a high copy number origin of replication functional in E. coll This permits facile 

1 0 production and testing of transgenes in E. coli prior to transfer to Agrobacterium for 
subsequent introduction into plants. Resistance genes can be carried on the vector, 
one for selection in bacteria, for example, streptomycin, and another that will function 
in plants, for example, a gene encoding kanamycin resistance or herbicide resistance. 
Also present on the vector are restriction endonuclease sites for the addition of one or 

1 5 more transgenes and directional T-DNA border sequences which, when recognized by 
the transfer functions of Agrobacterium, delimit the DNA region that will be 
transferred to the plant. 

In another example, plant cells may be transformed by shooting into the cell 
tungsten microprojectiles on which cloned DNA is precipitated. In the Biolistic 

2 0 Apparatus (Bio-Rad) used for the shooting, a gunpowder charge (22 caliber Power 
Piston Tool Charge) or an air-driven blast drives a plastic macroprojectile through a 
gun barrel. An aliquot of a suspension of tungsten particles on which DNA has been 
precipitated is placed on the front of the plastic macroprojectile. The latter is fired at 
an acrylic stopping plate that has a hole through it that is too small for the 

2 5 macroprojectile to pass through. As a result, the plastic macroprojectile smashes 
against the stopping plate, and the tungsten microprojectiles continue toward their 
target through the hole in the plate. For the instant invention the target can be any 
plant cell, tissue, seed, or embryo. The DNA introduced into the cell on the 
microprojectiles becomes integrated into either the nucleus or the chloroplast. 
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In general, transfer and expression of transgenes in plant cells are now routine 
practices to those skilled in the art, and have become major tools to carry out gene 
expression studies in plants and to produce improved plant varieties of agricultural or 
commercial interest. 

5 Transgenic P lant Regeneration 

Plants cells transformed with a plant expression vector can be regenerated, for 
example, from single cells, callus tissue, or leaf discs according to standard plant 
tissue culture techniques. It is well known in the art that various cells, tissues, and 
organs from almost any plant can be successfully cultured to regenerate an entire 
10 plant; such techniques are described, e.g., in Vasil supra; Green et al, supra; 
Weissbach and Weissbach, supra; and Gelvin et al., supra. 

In one particular example, a cloned HXK polypeptide or an antisense construct 
under the control of the 35S CaMV promoter and the nopaline synthase terminator 
and carrying a selectable marker (for example, kanamycin resistance) is transformed 
1 5 into Agrobacterium. Transformation of leaf discs (for example, of tobacco leaf discs), 
with vector-containing Agrobacterium is carried out as described by Horsch et al. 
(Science 227, 1229 (1985)). Putative transformants are selected after a few weeks (for 
example, 3 to 5 weeks) on plant tissue culture media containing kanamycin (e.g. 100 
Hg/ml). Kanamycin-resistant shoots are then placed on plant tissue culture media 
2 0 without hormones for root initiation. Kanamycin-resistant plants are then selected for 
greenhouse growth. If desired, seeds from self-fertilized transgenic plants can then be 
sowed in a soil-less medium and grown in a greenhouse. Kanamycin-resistant 
progeny are selected by sowing surfaced sterilized seeds on hormone-free kanamycin- 
containing media. Analysis for the integration of the transgene is accomplished by 
2 5 standard techniques (see, for example, Ausubel et al. supra; Gelvin et al. supra). 

Transgenic plants expressing the selectable marker are then screened for 
transmission of the transgene DNA by standard immunoblot and DNA detection 
techniques. Each positive transgenic plant and its transgenic progeny are unique in 
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comparison to other transgenic plants established with the same transgene. 
Integration of the transgene DNA into the plant genomic DNA is in most cases 
random, and the site of integration can profoundly effect the levels and the tissue and 
developmental patterns of transgene expression. Consequently, a number of 
5 transgenic lines are usually screened for each transgene to identify and select plants 
with the most appropriate expression profiles. 

Transgenic lines are evaluated on levels of transgene expression. Expression 
at the RNA level is determined initially to identify and quantitate expression-positive 
plants. Standard techniques for RNA analysis are employed and include PCR 

10 amplification assays using oligonucleotide primers designed to amplify only 

transgene RNA templates and solution hybridization assays using transgene-specific 
probes (see, e.g., Ausubel et al., supra). The RNA-positive plants are then analyzed 
for protein expression by Western immunoblot analysis using HXK specific 
antibodies (see, e.g., Ausubel et al., supra). In addition, in situ hybridization and 

15 immunocytochemistry according to standard protocols can be done using transgene- 
specific nucleotide probes and antibodies, respectively, to localize sites of expression 
within transgenic tissue. 

Once the recombinant HXK protein is expressed in any cell or in a transgenic 
plant (for example, as described above), it may be isolated, e.g., using affinity 

20 chromatography. In one example, an anti-HXK antibody (e.g., produced as described 
in Ausubel et al., supra, or by any standard technique) may be attached to a column 
and used to isolate the polypeptide. Lysis and fractionation of HXK-producing cells 
prior to affinity chromatography may be performed by standard methods (see, e.g., 
Ausubel et al., supra). Once isolated, the recombinant protein can, if desired, be 

2 5 further purified, for example, by high performance liquid chromatography (see, e.g., 
Fisher, Laboratory Techniques In Biochemistry? And Molecular Biology, eds., Work 
andBurdon, Elsevier, 1980). 

These general techniques of polypeptide expression and purification can also 
be used to produce and isolate useful HXK fragments or analogs. 
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Use 

The invention described herein is useful for a variety of agricultural and 
commercial purposes including, but not limited to, increasing crop yields, improving 
crop and ornamental quality, and reducing agricultural production costs. For 
5 example, the methods, DNA constructs, proteins, and transgenic plants described 
herein are useful for improving fruit and vegetable characteristics including: taste, 
texture, size, color, acidity or sweetness; nutritional content; disease resistance; and 
ripening processes. 

Our results presented above demonstrate that it is possible to modulate 

1 0 hexokinase gene expression in transgenic plants by providing for the transcription of a 
hexokinase sequence that is complementary to the mRNA of an endogenous plant 
hexokinase. In this manner, various plant processes can be modified, controlled, or 
manipulated, resulting in enhancement of production of carbohydrate (e.g., sucrose 
and starch) products, changes in plant growth, cellular differentiation and 

15 development, changes in plant phenotypes, and alteration of carbon/nitrogen 
partitioning and accumulation. In addition, as is discussed above, antisense 
expression can be controlled, if desired, in a cell-, tissue-, organ-, or developmentally- 
specific manner. Thus, the use of antisense control can provide for substantial 
inhibition or varying degrees of reduction of hexokinase gene expression. In this 

2 0 manner, cellular phenotypes can be modified without the production of extraneous 
proteins and with particular targeting to a specific gene. 

For example, transgenic plants expressing antisense hexokinase RNA 
constructs are useful for eliminating feedback inhibition of photosynthesis (for 
example, by sugar induced repression of photosynthetic genes) that is caused by the 

2 5 accumulation of sugar metabolites (for example, the photosynthetic endproducts 
sucrose and glucose). As shown herein, transgenic plants expressing antisense 
hexokinase genes are less sensitive to sugar, and are no longer subject to growth 
limitations and restrictions that arc the result of sugar repression (for example, 
reduction of photosynthetic gene expression). In particular, we have discovered that 
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transgenic plants expressing antisense hexokinase genes develop normally and thrive 
under conditions that typically limit and restrict plant growth due to feedback 
inhibition (for example, in wild-type plants shoot development is blocked by high 
hexose concentrations, but transgenic plants expressing antisense hexokinase shoot 
development proceeds normally). Thus transgenic plants expressing antisense 
hexokinase are useful for a variety of agricultural purposes including, but not limited 
to, the promotion of growth rate and development, seed germination, the stimulation 
of flowering, and improvement of crop yield, especially under adverse environmental 
conditions, for example, high light, high temperature, and high C0 2 . 

In addition, the results presented above demonstrate that it is possible to 
modulate a plants sensitivity to sugar by increasing levels of hexokinase protein. In 
particular, we found that increased levels of hexokinase protein are useful for 
promoting increased expression of a sugar-activated gene (for example, NR1). In this 
manner, various plant processes that are controlled, modulated, or activated by sugar 
can be regulated or manipulated by increasing the levels of hexokinase protein in a 
given plant cell, tissue, or organ. Such genetic engineering of gene expression is 
useful for enhancing storage protein accumulation and nitrogen accumulation, 
improving plant wounding responses and pathogen defense mechanisms, as well as 
for modifying pigmentation (for example, anthocyanin) of plant tissues (for example, 
fruits and flowers) for ornamental and horticultural purposes. For example, increased 
expression of hexokinase is useful for manipulating or promoting the expression of a 
wide variety of sugar-activated genes that encode an assortment of proteins including, 
but not limited to, potato storage protein patatin, soybean vegetative storage protein, 
sporamin, proteinase inhibitor II, sucrose phosphate synthase, rice and maize sucrose 
synthase, chalcone synthase, and nitrate reductase. 

All publications and patents mentioned in this specification are herein 
incorporated by reference to the same extend as if each individual publication or 
patent was specifically and individually indicated to be incorporated by reference. 
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Other Embodiments 
From the foregoing description, it will be apparent that variations and 
modifications may be made to the invention described herein to adopt it to various 
usages and conditions. Such embodiments are also within the scope of the following 
claims. 

What is claimed is: 
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(2) INFORMATION FOR SEQ ID NO: 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 453 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 : 

Met Gly Lys Val Ala Val Gly Ala Thr Val Val Cys Thr Ala Ala Val 
1 5 10 15 

Cys Ala Val Ala Val Leu Val Val Arg Arg Arg Met Gin Ser Ser Gly 
20 25 30 

Lys Tip Gly Arg Val Leu Ala He Leu Lys Ala Phe Glu Glu Asp Cys 
35 40 45 

Ala Thr Pro He Ser Lys Leu Arg Gin Val Ala Asp Ala Met Thr Val 
50 55 60 

Glu Met His Ala Gly Leu Ala Ser Asp Gly Gly Ser Lys Leu Lys Met 
65 70 75 80 

Leu He Ser Tyr Val Asp Asn Leu Pro Ser Gly Asp Glu Lys Gly Leu 
85 90 95 
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Phe Tyr Ala Leu Asp Leu Gly Gly Thr Asn Phe Arg Val Met Arg Val 
100 105 110 

Leu Leu Gly Gly Lys Gin Glu Arg Val Val Lys Gin Glu Phe Glu Glu 
115 120 125 

Val Ser He Pro Pro His Leu Met Thr Gly Gly Ser Asp Glu Leu Phe 
130 135 140 

Asn Phe He Ala Glu Ala Leu Ala Lys Phe Val Ala Thr Glu Cys Glu 
145 150 155 160 

Asp Phe His Leu Pro Glu Gly Arg Gin Arg Glu Leu Gly Phe Thr Phe 
165 170 175 

Ser Phe Pro Val Lys Gin Thr Ser Leu Ser Ser Gly Ser Leu He Lys 
180 185 190 

Trp Thr Lys Gly Phe Ser He Glu Glu Ala Val Gly Gin Asp Val Val 
195 200 205 

Gly Ala Leu Asn Lys Ala Leu Glu Arg Val Gly Leu Asp Met Arg He 
210 215 220 

Ala Ala Leu Val Asn Asp Thr Val Gly Thr Leu Ala Gly Gly Arg Tyr 
225 230 235 240 

Tyr Asn Pro Asp Val Val Ala Ala Val He Leu Gly Thr Gly Thr Asn 
245 250 255 
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Ala Ala Tyr Val Glu Arg Ala Thr Ala He Pro Lys Trp His Gly Leu 
260 265 270 

Leu Pro Lys Ser Gly Glu Met Val lie Asn Met Glu Trp Gly Asn Phe 
275 280 285 

Arg Ser Ser His Leu Pro Leu Thr Glu Phe Asp His Thr Leu Asp Phe 
290 295 300 

Glu Ser Leu Asn Pro Gly Glu Gin He Leu Glu Lys He lie Ser Gly 
305 310 315 320 

Met Tyr Leu Gly Glu He Leu Arg Arg Val Leu Leu Lys Met Ala Glu 
325 330 335 

Asp Ala Ala Phe Phe Gly Asp Thr Val Pro Ser Lys Leu Arg He Pro 
340 345 350 

Phe lie He Arg Thr Pro His Met Ser Ala Met His Asn Asp Thr Ser 
355 360 365 

Pro Asp Leu Lys He Val Gly Ser Lys lie Lys Asp He Leu Glu Val 
370 375 380 

Pro Thr Thr Ser Leu Lys Met Arg Lys Val Val lie Ser Leu Cys Asn 
385 390 395 400 

He He Ala Thr Arg Gly Ala Arg Leu Ser Ala Ala Gly He Tyr Gly 
405 410 415 



-46- 



WO 97/35965 



PCT/US97/04712 



He Leu Lys Lys Leu Gly Arg Asp Thr Thr Lys Asp Glu Glu Val Gin 
420 425 430 

Lys Ser Val lie Ala Met Asp Gly Gly Leu Phe Glu His Tyr Thr Gin 
435 440 445 

Phe Ser Glu Cys Met 
450 

(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 502 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Met Gly Lys Val Ala Val Ala Thr Thr Val Val Cys Ser Val Ala Val 
1 5 10 15 

Cys Ala Ala Ala Ala Leu He Val Arg Arg Arg Met Lys Ser Ala Gly 
20 25 30 

Lys Trp Ala Arg Val He Glu lie Leu Lys Ala Phe Glu Glu Asp Cys 
35 40 45 
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Ala Thr Pro lie Ala Lys Leu Arg Gin Val Ala Asp Ala Met Thr Val 
50 55 60 

Glu Met His Ala Gly Leu Ala Ser Glu Gly Gly Ser Lys Leu Lys Met 
65 70 75 80 

Leu He Ser Tyr Val Asp Asn Leu Pro Ser Gly Asp Glu Thr Gly Phe 
85 90 95 

Phe Tyr Ala Leu Asp Leu Gly Gly Thr Asn Phe Arg Val Met Arg Val 
100 105 110 

Leu Leu Gly Gly Lys His Asp Arg Val Val Lys Arg Glu Phe Lys Glu 
115 120 125 

Glu Ser He Pro Pro His Leu Met Thr Gly Lys Ser His Glu Leu Phe 
130 135 140 

Asp Phe He Val Asp Val Leu Ala Lys Phe Val Ala Thr Glu Gly Glu 
145 150 155 160 

Asp Phe His Leu Pro Pro Gly Arg Gin Arg Glu Leu Gly Phe Thr Phe 
165 170 175 

Ser Phe Pro Val Lys Gin Leu Ser Leu Ser Ser Gly Thr Leu He Asn 
180 185 190 

Tip Thr Lys Gly Phe Ser He Asp Asp Thr Val Asp Lys Asp Val Val 
195 200 205 
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Gly Glu Leu Val Lys Ala Met Glu Arg Val Gly Leu Asp Met Leu Val 
210 215 220 

Ala Ala Leu Val Asn Asp Thr He Gly Thr Leu Ala Gly Gly Arg Tyr 
225 230 235 240 

Thr Asn Pro Asp Val Val Val Ala Val lie Leu Gly Thr Gly Thr Asn 
245 250 255 

Ala Ala Tyr Val Glu Arg Ala His Ala He Pro Lys Tip His Gly Leu 
260 265 270 

Leu Pro Lys Ser Gly Glu Met Val He Asn Met Glu Trp Gly Asn Phe 
275 280 285 

Arg Ser Ser His Leu Pro Leu Thr Glu Tyr Asp His Ser Leu Asp Val 
290 295 300 

Asp Ser Leu Asn Pro Gly Glu Gin lie Leu Glu Lys He He Ser Gly 
305 310 315 320 

Met Tyr Leu Gly Glu He Leu Arg Arg Val Leu Leu Lys Met Ala Glu 
325 330 335 

Glu Ala Ala Phe Phe Gly Asp lie Val Pro Pro Lys Leu Lys He Pro 
340 345 350 

Phe He He Arg Thr Pro Asn Met Ser Ala Met His Ser Asp Thr Ser 
355 360 365 
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Pro Asp Leu Lys Val Val Gly Ser Lys Leu Lys Asp lie Leu Glu Val 
370 375 380 

Gin Thr Ser Ser Leu Lys Met Arg Lys Val Val He Ser Leu Cys Asn 
385 390 395 400 

He lie Ala Ser Arg Gly Ala Arg Leu Ser Ala Ala Gly He Tyr Gly 
405 410 415 

He Leu Lys Lys lie Gly Arg Asp Ala Thr Lys Asp Gly Glu Ala Gin 
420 425 430 

Lys Ser Val He Ala Met Asp Gly Gly Leu Phe Glu His Tyr Thr Gin 
435 440 445 

Phe Ser Glu Ser Met Lys Ser Ser Leu Lys Glu Leu Leu Gly Asp Glu 
450 455 460 

Val Ser Glu Ser Val Glu Val He Leu Ser Asn Asp Gly Ser Gly Val 
465 470 475 480 

Gly Ala Ala Leu Leu Ala Ala Ser His Ser Gin Tyr Leu Glu Leu Glu 
485 490 495 

Asp Asp Ser Glu Thr Ser 
500 

(2) INFORMATION FOR SEQ ID NO:3: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH : 2023 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

CAGTGTGAGT AATTTAGATC GGTATTAGAT CCATCTTAGG TTTCTCTAAT TTCTCTCAAT 60 

TCACTCCAAA ATTTTGATTA TTTCTTCTTT CTGGCTTGTC AATTTTAGTC ATTTGTAATC 120 

CTTGCTTTTG CG ATCGG A AT CGTAA AAATC CGATCTTTCT TTTAG ATTCG TTTTGTTTTT 1 80 

GATTCCAA AT CGGA A A AATG GGTAAAGTAG CTGTTGGAGC GACTGTTGTT TGCACGGCGG 240 

CGGTTTGTGC GGTGGCTGTT TTGGTTGTTC GACGACGGAT GCAGAGCTCA GGGAAGTGGG 300 

GACGTGTTTT GGCTATCCTC AAGGCCTTTG AAGAGGATTG TGCGACTCCG ATCTCGAAAC 360 

TGAGACAAGT GGCTGATGCT ATGACCGTTG AGATGCATGC TGGTCTTGCA TCCGACGGTG 420 

GTAGCA AACT CAAGATGCTT ATCAGCTACG TTGATAATCT TCCTTCCGGG GATGAAAAGG 480 

GTCTCTTTTA TGCATTGGAC CTAGGGGGGA CAAACTTCCG TGTCATGCGT GTGCTTCTTG 540 

GCGGGAAGCA AGAGCGTGTT GTTAAACAAG AATTCGAAGA AGTTTCGATT CCTCCTCATT 600 

TGATGACTGG TGGTTCAGAT GAGTTGTTCA ATTTTATAGC TGAAGCTCTT GCGAAGTTTG 660 

TCGCTACAGA ATGCGAAGAC TTTCATCTTC CAGAAGGTAG ACAGAGGGAA TTAGGTTTCA 720 

CTTTCTCGTT TCCTGTTAAG CAGACTTCTC TGTCCTCTGG TAGTCTCATC AAATGGACAA 780 

AAGGCTTTTC CATCGAAGAA GCAGTTGGAC AAGATGTTGT TGGAGCACTT AATAAGGCTC 840 
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TGGAAAGAGT TGGTCTTGAC ATGCGAATCG CAGCACTTGT TAATGATACC GTTGGAACAC 900 

TAGCCGGTGG TAGATACTAT AACCCGGATG TTGTTGCTGC TGTTATTTTA GGCACTGGGA 960 

CAAACGCAGC CTATGTTGAG CGTGCAACCG CGATCCCTAA ATGGCATGGT CTGCTTCCAA 1020 

AATCAGGAGA AATGGTTATA AACATGGAAT GGGGAAACTT CAGGTCATCA CATCTTCCAT 1080 

TAACCGAGTT TGATCACACG CTGGATTTCG AGAGTCTGAA TCCAGGCGAA CAGATTCTTG 1 140 

AGAA AATCAT TTCCGGTATG TACTTGGGAG AGATTTTGCG AAGAGTTCTT CTAAAGATGG 1200 

CTGAAGATGC TGCTTTCTTT GGCGATACAG TCCCATCTAA GCTGAGAATA CCATTCATCA 1260 

TTAGGACTCC TCACATGTCG GCTATGCACA ACGACACTTC TCCAGACTTG AAGATTGTTG 1320 

GGAGCAAGAT TAAGGATATA TTGGAGGTCC CTACAACTTC TCTGAAAATG AGAAAAGTTG 1380 

TGATCAGTCT CTGCAACATC ATAGCAACCC GAGGAGCTCG TCTCTCTGCT GCTGGAATCT 1440 
ATGGTATTCT GAAGAAACTG GGAAGAGATA CTACTAAAGA CGAGGAGGTG CAGAAATCGG 1500 

TTATAGCCAT GG ATGGTGG A TTGTTTGAGC ATTACACTCA GTTTAGTGAG TGTATGG AG A 1 560 

GCTCACTAAA AGAGTTGCTT GGAGATGAAG CTTCAGGAAG CGTTGAAGTC ACTCACTCCA 1620 .. 

ATGATGGATC AGGCATTGG A GCTGCGCTTC TTGCTGCTTC TCACTCTCTC TACCTTGAAG 1 680 

ACTCTTAAAA CCTACCCA A A GAGCGCCATT TTTCGGTAAT TTACTG AAAG CTTTTCGCTA 1 740 

TCAGAAAACG CCTAAGCCA A GTTCTAAGGC GTCATAAAAG AAAGCATTCC ATGTTTTTAC 1 800 

TCTTCCCCAA GACTTTCTTT GTAGCAAATA AGTTTCCTTG GGAG AAATAT TTGTTTTCAT 1 860 

GTTCTTCAA A A ATA AAAG AC TCAGTTCTTC AG ATTCTGGG ATTTTATTAT AACCAGATAT 1 920 

GTTGTAAAAA CTACAAATTC AAAGCTCACT TCACTGGAGT TCTGAGTATA TAAAGATTTC 1980 
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ATTTTTCCTA AAAAAAAAAA AAAAAACTAA ATTACTCACA CTC 

2023 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1 883 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

CAGTGTGAGT AATTTAGATC ATCTCTAGCG TTCTTAA AGT TTCCAACTTT TTTTTTTTAT 60 
TA ATTTGGGC CAACTTTTTT GTTTTATTAA TTTGGGCCAA CCTTTTTTGG TTTGAG AATT 1 20 
GGGCGAGGGA GAAAGATGGG TAAAGTGGCA GTTGCAACGA CGGTAGTGTG TTCGGTGGCG 180 
GTATGTGCGG CGGCGGCGTT GATAGTACGG AGGAGAATG A A AAGCGCAGG GAAATGGGCA 240 
AGAGTGATAG AGATATTGAA AGCCTTTGAA GAAGATTGTG CA ACGCCAAT TGCCA AATTG 300 
AGACAAGTGG CTGATGCTAT GACTGTTGAG ATGCATGCTG GTCTTGCTTC TGAAGGTGGC 360 
AGCAAGCTTA AGATGCTTAT TAGCTACGTT GATAATCTTC CTTCTGGGGA TGAGACTGGT 420 
TTTTTCTATG CGTTGGATCT AGGCGGAACA AACTTCCGTG TTATGCGTGT GCT TC T TG GT 480 
GGGAAGCACG ACCGTGTTGT TAAACG AGAA TTCAAAGA AG A ATCTATTCC TCCTCATTTG 540 
ATGACCGGGA AGTCACATGA ATTATTCG AT TTTATCGTTG ATGTTCTTGC CA AGTTTGTC 600 
GCTACAGA AG GCGAGGACTT TCATCTCCCA CCTGGTAGAC A ACGGGAACT AGGnTCACT 660 
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TTCTCATTTC CGGTTAAGCA GCTATCTTTA TCCTCTGGCA CTCTCATCAA CTGGACAAAG 720 

GGCTTTTCCA TTGACGATAC AGTTGATA AA GATGTTGTTG GAGAACTTGT TAAAGCTATG 780 

GAAAG AGTTG GGCTGGACAT GCTTGTCGCA GCGCTTGTTA ATGATACCAT TGG A ACACTT 840 

GCGGGTGGTA GATACACTAA CCCGGATGTC GTTGTCGCAG TTATTTTGGG CACCGGCACA 900 

AATGCAGCCT ATGTCGAACG TGCACATGCA ATTCCCAAAT GGCATGGTTT GCTACCCAAA 960 

TCAGG AG A A A TGGTG ATCA A CATGG AATGG GGA A ACTTCA GGTCATCACA TCTTCCATTG 1 020 

ACAGAGTACG ACCACTCTCT AGATGTCGAT AGTTTGAATC CTGGTGAACA GATTCTTGAG 1080 

AAAATCATTT CCGGAATGTA TCTGGGAGAA ATCTTGCGTA GAGTTCTTCT GAAGATGGCT 1 140 

GAAGA AGCTG CCTTCTTTGG CGATATCGTC CCACCTA AGC TGAAAATACC ATTCATCATA 1 200 

AGG ACCCCCA ACATGTCTGC TATGCACAGT GATACTTCCC CGG ATTTGAA GGTTGTAGGA 1 260 
AGCAAGTTAA A AG AC AT ATT GGAGGTCCAG ACTAGTTCTC TGAAGATGAG GAAAGTTGTG 1320 

ATCAGCCTAT GTA ACATCAT TGCAAGCCGA GGAGCTCGTT TATCTGCTGC GGGGATCTAT 1 380 
GGAATCCTCA AGAAAATAGG AAGAGACGCA ACAAAAGATG GAGAAGCTCA GAAATCTGTG 14fW 

ATAGCG ATGG ACGGTGGGCT ATTCGAGCAT TACACTCAGT TCAGTGAGTC G ATGAAGAGT 1 500 
TCATTG A AAG AGTTGCTTGG AG ATGAAGTT TCAGAG AGTG TTGAAGTGAT ACTGTCGA AT 1 560 
GATGGTTCAG GTGTTGG AGC TGCATTACTT GCTGCTTCTC ACTCTCAGTA TCTCGAACTT 1 620 
GAAGATG ACT CTG AAACA AG TTAATTTA A A GCTTTTTTGT GTTTAACCTT CTTCTTGTTG 1 680 
CGTAGGTTA A CAATAAAAGT AGAGGTA A AT GCCTTTGGGA A ATTTTATTT TTGACAATTT 1 740 
TCAGG AACAA TA AAACCTGG ATTCTTCATC AAAGCTCTGG GAA ATTCAAA CGACCAGCCA 1 800 

ATGTTGTAGA ACTATACATA TATATTCGAG TTCTTTCTAT GAA AAAAAAA AAAAAAAAAA I860 
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(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 465 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

Met Leu Asp Asp Arg Ala Arg Met Glu Ala Ala Lys Lys Glu Lys Val 
1 5 10 is 

Glu Gin He Leu Ala Glu Phe Gin Leu Gin Glu Glu Asp Leu Lys Lys 
20 25 30 

Val Met Arg Arg Met Gin Lys Glu Met Asp Arg Gly Leu Arg Leu Glu 
35 40 45 

Thr His Glu Glu Ala Ser Val Lys Met Leu Pro Thr Tyr Val Arg Ser 
50 55 60 

Thr Pro Glu Gly Ser Glu Val Gly Asp Phe Leu Ser Leu Asp Leu Gly 
65 70 75 80 
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Gly Thr Asn Phe Arg Val Met Leu Val Lys Val Gly Glu Gly Glu Glu 
85 90 95 

Gly Gin Trp Ser Val Lys Thr Lys His Gin Met Tyr Ser He Pro Glu 
100 105 110 



Asp Ala Met Thr Gly Thr Ala Glu Met Leu Phe Asp Tyr He Ser Glu 
115 120 125 



Cys He Ser Asp Phe Leu Asp Lys His Gin Met Lys His Lys Lys Leu 
130 135 140 

Pro Leu Gly Phe Thr Phe Ser Phe Pro Val Arg His Glu Asp lie Asp 
145 150 155 160 



Lys Gly He Leu Leu Asn Gin Thr Lys Gly Phe Lys Ala Ser Gly Ala 
165 170 175 

Glu Gly Asn Asn Val Val Gly Leu Leu Arg Asp Ala He Lys Arg Arg 
180 185 190 

Gly Asp Phe Glu Met Asp Val Val Ala Met Val Asn Asp Thr Val Ala 
195 200 205 

Thr Met He Ser Cys Tyr Tyr Glu Asp His Gin Cys Glu Val Gly Met 
210 215 220 225 

He Val Gly Thr Gly Cys Asn Ala Cys Tyr Met Glu Glu Met Gin Asn 
230 235 240 
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Val Glu Leu Val Glu Gly Asp Glu Gly Arg Met Cys Val Asn Thr Glu 
245 250 255 

Gin Gly Ala Phe Gly Asp Ser Gly Glu Leu Asp Glu Phe Leu Leu Glu 
260 265 270 

Tyr Asp Arg Met Val Asp Glu Ser Ser Ala Asn Pro Gly Gin Gin Leu 
275 280 285 

Tyr Glu Lys Leu He Gly Gly Lys Thr Met Gly Glu Leu Val Arg Leu 
290 295 300 305 

Val Leu Leu Arg Leu Val Asp Glu Asn Leu Leu Phe His Gly Glu Ala 
310 315 320 

Ser Glu Gin Leu Arg Thr Arg Gly Ala Phe Glu Thr Arg Phe Val Ser 
325 330 335 

Gin Val Glu Ser Asp Thr Gly Asp Arg Lys Gin He Tyr Asn He Leu 
340 345 350 

Ser Thr Leu Gly Leu Arg Pro Ser Thr Thr Asp Cys Asp He Val Arg 
355 360 365 

Arg Ala Cys Glu Ser Val Ser Thr Arg Ala Ala His Met Cys Ser Ala 
370 375 380 385 

Gly Leu Ala Gly Val He Asn Arg Met Arg Glu Ser Arg Ser Glu Asp 
390 395 400 
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Val Met Arg lie Thr Val Gly Val Asp Gly Ser Val Tyr Lys Leu His 
405 410 415 

Pro Ser Phe Lys Glu Arg Phe His Ala Ser Val Arg Arg Leu Thr Pro 
420 425 430 

Ser Cys Glu He Thr Phe He Glu Ser Glu Glu Gly Ser Gly Arg Gly 
435 440 445 

Ala Ala Leu Val Ser Ala Val Ala Cys Lys Lys Ala Cys Met Leu Gly 
450 455 460 465 

Gin 

(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 465 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

Met Ala Met Asp Thr Thr Arg Cys Gly Ala Gin Leu Leu Thr Leu Val 
1 5 10 15 

Glu Gin He Leu Ala Glu Phe Gin Leu Gin Glu Glu Asp Leu Lys Lys 
20 25 30 
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Val Met Ser Arg Met Gin Lys Glu Met Asp Arg Gly Leu Arg Leu Glu 
35 40 45 

Thr His Glu Glu Ala Ser Val Lys Met Leu Pro Thr Tyr Val Arg Ser 
50 55 60 

Thr Pro Glu Gly Ser Glu Val Gly Asp Phe Leu Ser Leu Asp Leu Gly 
65 70 75 80 

Gly Thr Asn Phe Arg Val Met Leu Val Lys Val Gly Glu Gly Glu Ala 
85 90 95 

Gly Gin Trp Ser Val Lys Thr Lys His Gin Met Tyr Ser He Pro Glu 
100 105 HO 

Asp Ala Met Thr Gly Thr Ala Glu Met Leu Phe Asp Tyr He Ser Glu 
115 120 125 

Cys He Ser Asp Phe Leu Asp Lys His Gin Met Lys His Lys Lys Leu 
130 135 140 

Pro Leu Gly Phe Thr Phe Ser Phe Pro Val Arg His Glu Asp Leu Asp 
145 150 155 160 

Lys Gly He Leu Leu Asn Trp Thr Lys Gly Phe Lys Ala Ser Gly Ala 
165 170 175 

Glu Gly Asn Asn He Val Gly Leu Leu Arg Asp Ala lie Lys Arg Arg 
180 185 190 
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Gly Asp Phe Glu Met Asp Val Val Ala Met Val Asn Asp Thr Val Ala 
195 200 205 

Thr Met He Ser Cys Tyr Tyr Glu Asp Arg Gin Cys Glu Val Gly Met 
210 215 220 

He Val Gly Thr Gly Cys Asn Ala Cys Tyr Met Glu Glu Met Gin Asn 
225 230 235 240 

Val Glu Leu Val Glu Gly Asp Glu Gly Arg Met Cys Val Asn Thr Glu 
245 250 255 

Tip Gly Ala Phe Gly Asp Ser Gly Glu Leu Asp Glu Phe Leu Leu Glu 
260 265 270 

Tyr Asp Arg Met Val Asp Glu Ser Ser Ala Asn Pro Gly Gin Gin Leu 
275 280 285 

Tyr Glu Lys He He Gly Gly Lys Tyr Met Gly Glu Leu Val Arg Leu 
290 295 300 

Val Leu Leu Lys Leu Val Asp Glu Asn Leu Leu Phe His Gly Glu Ala 
305 310 315 320 

Ser Glu Gin Leu Arg Thr Arg Gly Ala Phe Glu Thr Arg Phe Val Ser 
325 330 335 

Gin Val Glu Ser Asp Ser Gly Asp Arg Lys Gin He His Asn He Leu 
340 345 350 
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Ser Thr Leu Gly Leu Arg Pro Ser Val Thr Asp Cys Asp He Val Arg 
355 360 365 

Arg Ala Cys Glu Ser Val Ser Thr Arg Ala Ala His Met Cys Ser Ala 
370 375 380 

Gly Leu Ala Gly Val He Asn Arg Met Arg Glu Ser Arg Ser Glu Asp 
385 390 395 400 

Val Met Arg He Thr Val Gly Val Asp Gly Ser Val Tyr Lys Leu His 
405 410 415 

Pro Ser Phe Lys Glu Arg Phe His Ala Ser Val Arg Arg Leu Thr Pro 
420 425 430 

Asn Cys Glu He Thr Phe He Glu Ser Glu Glu Gly Ser Gly Arg Gly 
435 440 445 

Ala Ala Leu Val Ser Ala Val Ala Cys Lys Lys Ala Cys Met Leu Ala 
450 455 460 

Gin 

465 

(2) INFORMATION FOR SEQ ID NO:7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 486 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

Met Val His Leu Gly Pro Lys Lys Pro Gin Ala Arg Lys Gly Ser Met 
1 5 10 15 

Ala Asp Val Pro Lys Glu Leu Met Asp Glu He His Gin Leu Glu Asp 
20 25 30 

Met Phe Thr Val Asp Ser Glu Thr Leu Arg Lys Val Val Lys His Phe 
35 40 45 

He Asp Glu Leu Asn Lys Gly Leu Thr Lys Lys Gly Val Asn He Pro 
50 55 60 

Met He Pro Gly Trp Val Met Glu Phe Pro Thr Gly Lys Glu Ser Gly 
65 70 75 80 

Asn Tyr Leu Ala He Asp Leu Gly Gly Thr Asn Leu Arg Val Val Leu 
85 90 95 

Val Lys Leu Ser Gly Asn Arg Thr Phe Asp Thr Thr Gin Ser Lys Tyr 
100 105 110 

Lys Leu Pro His Asp Met Arg Thr Thr Lys His Gin Glu Glu Leu Trp 
115 120 125 
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Ser Phe lie Ala Asp Ser Leu Lys Asp Phe Met Val Glu Gin Glu Leu 
130 135 140 

Leu Asn Thr Lys Asp Thr Leu Pro Leu Gly Phe Thr Phe Ser Tyr Pro 
145 150 155 160 

Ala Ser Gin Asn Lys He Asn Glu Gly He Leu Gin Arg Tip Thr Lys 
165 170 175 
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Gly Phe Asp He Pro Asn Val Glu Gly His Asp Val Val Pro Leu Leu 
180 185 190 

Gin Lys Glu He Ser Lys Arg Glu Leu Pro He Glu lie Val Ala Leu 
195 200 205 

He Asn Asp Thr Val Gly Thr Leu He Ala Ser Tyr Tyr Thr Asp Pro 
210 215 220 

Glu Thr Lys Met Gly Val He Phe Gly Thr Gly Val Asn Gly Ala Phe 
225 230 235 240 

Tyr Asp Val Cys Ser Asp lie Glu Lys Leu Glu Gly Lys Leu Ala Asp 
245 250 255 

Asp He Pro Ser Asn Ser Pro Met Ala He Asn Cys Glu Tyr Gly Ser 
260 265 270 

Phe Asp Asn Glu His Leu Val Leu Pro Arg Thr Lys Tyr Asp Val Ala 
275 280 285 

Val Asp Glu Gin Ser Pro Arg Pro Gly Gin Gin Ala Phe Glu Lys Met 
290 295 300 

Thr Ser Gly Tyr Tyr Leu Gly Glu Leu Leu Arg Leu Val Leu Leu Glu 
305 310 315 320 

Leu Asn Glu Lys Gly Leu Met Leu Lys Asp Gin Asp Leu Ser Lys Leu 
325 330 335 
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Lys Gin Pro Tyr He Met Asp Thr Ser Tyr Pro Ala Arg He Glu Asp 
340 345 350 

Asp Pro Phe Glu Asn Leu Glu Asp Thr Asp Asp Met Phe Gin Lys Asp 
355 360 365 

Phe Gly Val Lys Thr Thr Leu Pro Glu Arg Lys Leu He Arg Arg Leu 
370 375 380 

Cys Glu Leu He Gly Thr Arg Ala Ala Arg Leu Ala Val Cys Gly He 
385 390 395 400 

Ala Ala He Cys Gin Lys Arg Gly Tyr Lys Thr Gly His He Ala Ala 
405 410 415 



Asp Gly Ser Val Tyr Asn Lys Tyr Pro Gly Phe Lys Glu Ala Ala Ala 
420 425 430 

Lys Gly Leu Arg Asp He Tyr Gly Trp Thr Gly Glu Asn Ala Ser Lys 
435 440 445 

Asp Pro He Thr He Val Pro Ala Glu Asp Gly Ser Gly Ala Gly Ala 
450 455 460 

Ala Val He Ala Ala Leu Ser Glu Lys Arg He Ala Glu Gly Lys Val 
465 470 475 480 



Ser Gly He He Gly Ala 
485 
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(2) INFORMATION FOR SEQ ID NO:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 486 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

Met Val His Leu Gly Pro Lys Lys Pro Gin Ala Arg Lys Gly Ser Met 
15 10 15 

Ala Asp Val Pro Lys Glu Leu Met Gin Gin He Glu Asn Phe Glu Lys 
20 25 30 

He Phe Thr Val Pro Thr Glu Thr Leu Gin Ala Val Thr Lys His Phe 
35 40 45 

He Ser Glu Leu Glu Lys Gly Leu Ser Lys Lys Gly Gly Asn He Pro 
50 55 60 

Met He Pro Gly Trp Val Met Asp Phe Pro Thr Gly Lys Glu Ser Gly 
65 70 75 80 

Asn Tyr Leu Ala He Asp Leu Gly Gly Thr Asn Leu Arg Val Val Leu 

85 90 95 

Val Lys Leu Gly Gly Asp Arg Thr Phe Asp Thr Thr Gin Ser Lys Tyr 
100 105 110 
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Arg Leu Pro Asp Ala Met Arg Thr Thr Gin Asn Pro Asp Glu Leu Trp 
115 120 125 

Glu Phe He Ala Asp Ser Leu Lys Ala Phe lie Asp Glu Gin Phe Pro 
130 135 140 

Gin Gly He Ser Glu Pro He Pro Leu Gly Phe Thr Phe Ser Tyr Pro 
145 150 155 160 

Ala Ser Gin Asn Lys He Asn Glu Gly He Leu Gin Arg Trp Thr Lys 
165 170 175 

Gly Phe Asp He Pro Asn Val Glu Gly His Asp Val Val Pro Leu Leu 
180 185 190 

Gin Lys Glu He Ser Lys Arg Glu Leu Pro He Glu Cys Cys Ala Leu 
195 200 205 

He Asn Asp Thr Thr Gly Thr Leu Val Ala Ser Tyr Tyr Thr Asp Pro 
210 215 220 

Glu Thr Lys Met Gly Val lie Phe Gly Thr Gly Val Asn Gly Ala Tyr 
22 5 230 235 240 

Tyr Asp Val Cys Ser Asp lie Glu Lys Leu Trp Gly Lys Leu Ser Asp 
245 250 255 
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Asp He Pro Pro Ser Ala Pro Met Ala He Asn Cys Glu Tyr Gly Ser 
260 265 270 

Phe Asp Asn Glu His Val Val Leu Pro Arg Thr Lys Tyr Asp He Thr 
275 280 285 

He Asp Glu Glu Ser Pro Arg Pro Gly Trp Trp Thr Phe Glu Lys Met 
290 295 300 

Ser Ser Gly Tyr Tyr Leu Gly Glu lie Leu Arg Leu Ala Leu Met Asp 
305 310 315 320 

Met Tyr Lys Gin Gly Phe He Phe Lys Asn Gin Asp Leu Ser Lys Phe 
325 330 335 

Asp Lys Pro Phe Val Met Asp Thr Ser Tyr Pro Ala Arg He Glu Glu 
340 345 350 

Asp Pro Phe Glu Asn Leu Glu Asp Thr Asp Asp Leu Phe Gin Asn Glu 
355 360 365 

Phe Gly He Asn Thr Thr Val Gin Glu Arg Lys Leu He Arg Arg Leu 
370 375 380 

Ser Glu Leu He Gly Ala Arg Ala Ala Arg Leu Ser Val Cys Gly He 
385 390 395 400 

Ala Ala He Cys Gin Lys Arg Gly Tyr Lys Thr Gly His He Ala Ala 
405 410 415 



-68- 



WO 97/35965 



PCIYUS97/04712 



Asp Gly Ser Val Tyr Asn Arg Tyr Pro Gly Phe Lys Glu Lys Ala Ala 
420 425 430 

Asn Ala Leu Lys Asp He Tyr Gly Trp Thr Gin Thr Ser Leu Asp Asp 
435 440 445 

Tyr Pro He Lys He Val Pro Ala Glu Asp Gly Ser Gly Ala Gly Ala 
450 455 460 

Ala Val He Ala Ala Leu Ala Gin Lys Arg He Ala Glu Gly Lys Ser 
465 470 475 480 

Val Gly He He Gly Ala 
485 

(2) INFORMATION FOR SEQ ID NO:9: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 485 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

Met Val Arg Leu Gly Pro Lys Lys Pro Pro Ala Arg Lys Gly Ser Met 
1 5 10 15 
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Ala Asp Val Pro Ala Asn Leu Met Glu Gin He His Gly Leu Glu Thr 
20 25 30 

Leu Phe Thr Val Ser Ser Glu Lys Met Arg Ser He Val Lys His Phe 
35 40 45 

He Ser Glu Leu Asp Lys Gly Leu Ser Lys Lys Gly Gly Asn He Pro 
50 55 60 

Met He Pro Gly Trp Val Val Glu Tyr Pro Thr Gly Lys Glu Thr Gly 
65 70 75 80 

Asp Phe Leu Ala Leu Asp Leu Gly Gly Thr Asn Leu Arg Val Val Leu 
85 90 95 

Val Lys Leu Gly Gly Asn His Asp Phe Asp Tyr Tyr Gin Asn Lys Tyr 
100 105 110 

Arg Leu Pro Asp His Leu Arg Thr Gly Thr Ser Glu Gin Leu Trp Ser 
115 120 125 

Phe He Ala Lys Cys Leu Lys Glu Phe Val Asp Glu Trp Tyr Pro Asp 
130 135 140 

Gly Val Ser Glu Pro Leu Pro Leu Gly Phe Thr Phe Ser Tyr Pro Ala 
145 150 155 160 

Ser Gin Lys Lys lie Asn Ser Gly Val Leu Gin Arg Trp Thr Lys Gly 
165 170 175 
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Phe Asp He Glu Gly Val Glu Gly His Asp Val Val Pro Met Leu Gin 
180 185 190 

Glu Gin He Glu Lys Leu Asn He Pro He Asn Val Val Arg Leu lie 
195 200 205 

Asn Asp Thr Thr Gly Thr Leu Val Ala Ser Leu Tyr Thr Asp Pro Gin 
210 215 220 



Thr Lys Met Gly He He He Gly Thr Gly Val Asn Gly Ala Tyr Tyr 
225 230 235 240 

Asp Val Val Ser Gly He Glu Lys Leu Glu Gly Leu Leu Pro Glu Asp 
245 250 255 

He Gly Pro Asp Ser Pro Met Ala lie Asn Cys Glu Tyr Gly Ser Phe 
260 265 270 

Asp Asn Glu Gly Leu Val Leu Pro Arg Thr Lys Tyr Asp Val He He 
275 280 285 

Asp Glu Glu Ser Pro Arg Pro Gly Gin Gin Ala Phe Glu Lys Met Thr 
290 295 300 

Ser Gly Tyr Tyr Leu Gly Glu lie Met Arg Leu Val Leu Leu Asp Leu 
305 310 315 320 
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Tyr Asp Ser Gly Phe He Phe Lys Asp Gin Asp lie Ser Lys Leu Lys 
325 330 335 

Glu Ala Tyr Val Met Asp Thr Ser Tyr Pro Ser Lys lie Glu Asp Asp 
340 345 350 

Pro Phe Glu Asn Leu Glu Asp Thr Asp Asp Leu Phe Lys Thr Asn Leu 
355 360 365 

Asn He Glu Thr Thr Val Val Glu Arg Lys Leu He Arg Lys Leu Ala 
370 375 380 

Glu Leu Val Gly Thr Arg Ala Ala Arg Leu Thr Val Cys Gly Val Ser 
385 390 395 400 

Ala lie Cys Asp Lys Arg Gly Tyr Lys Thr Ala His He Ala Ala Asp 
405 410 415 

Gly Ser Val Phe Asn Arg Tyr Pro Gly Tyr Lys Glu Lys Ala Ala Gin 
420 425 430 

Ala Leu Lys Asp He Tyr Asn Tip Asp Val Glu Lys Met Glu Asp His 
435 440 445 

Pro lie Gin Leu Val Ala Ala Glu Asp Gly Ser Gly Val Gly Ala Ala 
450 455 460 

He He Ala Cys Leu Thr Trp Lys Arg Leu Ala Ala Gly Lys Ser Val 
465 470 475 480 
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Gly He Lys Gly Glu 
485 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
ATGGGTAAAG TAGCTGTT 
(2) INFORMATION FOR SEQ ID NO:l 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l 1 : 

ATGGGTAAAG TGGCAGTTGC AA 
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(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12. 

TTAAGAGTCT TCAAGGTAGA G 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

TTAACTTGTT TCAGAGTCAT CTTC 
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Claims 

1 . A method for reducing the level of a plant hexokinase protein in a 
transgenic plant cell, said method comprising expressing in said transgenic plant cell 
an antisense hexokinase nucleic acid sequence. 

2: The method according to claim 1 , wherein said plant cell is from a 
monocot. 

3. The method according to claim 1, wherein said plant cell is from a dicot. 

4. The method according to claim 1 , wherein said plant cell is from a 
gymnosperm. 

5. The method according to claim 1, wherein said hexokinase nucleic acid 
sequence is encoded by a transgene integrated into the genome of said transgenic 
plant cell. 

6. The method according to claim 1 , further comprising growing a transgenic 
plant from said transgenic plant cell, whereby the level of said hexokinase protein is 
reduced in said transgenic plant. 

7. The method of claim 1, wherein said antisense hexokinase nucleic acid 
sequence is based on the AtHXKl nucleotide sequence that is shown Fig. IF (SEQ ID 
NO: 3). 

8. The method of claim 1, wherein said antisense hexokinase nucleic acid 
sequence is based on the AtHXK2 nucleotide sequence that is shown Fig. 1G (SEQ 
ID NO: 4). 
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9. The method according to claim 6, wherein said transgenic plant is less 
sensitive to sugar. 

10. A plant cell expressing an antisensc hexokinase nucleic acid sequence. 

1 1 . A plant cell according to claim 10, which is from a monocot. 

12. A plant cell according to claim 10, which is from a dicot. 

13. A plant cell according to claim 10, which is from a gymnosperm. 

14. An plant expression vector comprising an antisense hexokinase nucleic 
acid sequence, wherein said sequence is operably linked to an expression control 
region. 

15. A substantially pure plant HXK polypeptide comprising an amino acid 
sequence substantially identical to the amino acid sequence of AtHXKl (SEQ ID NO: 

1). 

16. The polypeptide of claim 1 5, wherein said polypeptide comprises the 
amino acid sequence shown in Fig. IB (SEQ ID NO: 1). 

1 7. The polypeptide of claim 1 5, wherein said polypeptide is from a monocot. 

1 8. The polypeptide of claim 15, wherein said polypeptide is from a dicot. 

1 9. The polypeptide of claim 1 8, wherein said dicot is a member of the 
Solanaceae. 
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20. The polypeptide of claim 1 8, wherein said dicot is a member of the 
Cruciferae. 

21. The polypeptide of claim 20, wherein said cruciferous plant is 
Arabidopsis. 

5 22. The polypeptide of claim 15, wherein said polypeptide is from a 

gymnosperm. 

23. A substantially pure plant HXK polypeptide comprising an amino acid 
sequence substantially identical to the amino acid sequence of AtHXK2 (SEQ ID NO: 
2 >- 

10 24. The polypeptide of claim 23, wherein said polypeptide comprises the 

amino acid sequence shown in Fig. IB (SEQ ID NO: 2). 

25. The polypeptide of claim 23, wherein said polypeptide is from a monocot. 

26. The polypeptide of claim 23, wherein said polypeptide is from a dicot. 

27. The polypeptide of claim 26, wherein said dicot is a member of the 
15 Solanaceae. 

28. The polypeptide of claim 26, wherein said dicot is a member of the 
Cruciferae. 

29. The polypeptide of claim 28, wherein said cruciferous plant is 
Arabidopsis. 
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30. The polypeptide of claim 23, wherein said polypeptide is from a 
gymnosperm. 

3 1 . Substantially pure DNA encoding a plant HXK polypeptide comprising an 
amino acid sequence substantially identical to the amino acid sequence of an AtHXKl 
(SEQIDNO: 1). 

32. The DNA of claim 31, wherein said DNA comprises the nucleotide 
sequence shown in Fig. IF (SEQ ID NO: 3). 

33. The DNA of claim 31, wherein said DNA is from a monocot. 

34. The DNA of claim 31, wherein said DNA is from a dicot. 

35. The DNA of claim 34, wherein said dicot is a member of the Solanaceae. 

36. The DNA of claim 34, wherein said dicot is a member of the Cruciferae. 

37. The DNA of claim 36, wherein said cruciferous plant is Arabidopsis, 

38. The DNA of claim 3 1 , wherein said DNA is from a gymnosperm. 

39. Substantially pure DNA encoding a plant HXK polypeptide comprising an 
amino acid sequence substantially identical to the amino acid sequence of an AtHXK2 
(SEQ ID NO: 2). 

40. The DNA of claim 39, wherein said DNA comprises the nucleotide 
sequence shown in Fig. IF (SEQ ID NO: 4). 
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41 . The DNA of claim 39, wherein said DNA is from a monocot. 

42. The DNA of claim 39, wherein said DNA is from a dicot. 

43. The DNA of claim 42, wherein said dicot is a member of the Solanaceae. 

44. The DNA of claim 42, wherein said dicot is a member of the Cruci ferae. 

45. The DNA of claim 44, wherein said cruciferous plant is Arahidopsis. 

46. The DNA of claim 39, wherein said DNA is from a gymnosperm. 

47. The DNA of claim 3 1 , wherein said DNA is operably linked to a 
constitutive or regulated promoter. 

48. The DNA of claim 39, wherein said DNA is operably linked to a 
constitutive or regulated promoter. 

49. A vector comprising the substantially pure DNA of claim 31, said vector 
being capable of directing expression of the protein encoded by said DNA in a vector- 
containing cell. 

50. A cell which comprises the DNA of claim 3 1 . 

5 1 . The cell of claim 50, said cell being a plant cell. 

52. The cell of claim 50, said plant cell being hypersensitive to sugar. 

53. The cell of claim 50, said plant cell being less sensitive to sugar. 
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54. A vector comprising the substantially pure DNA of claim 39, said vector 
being capable of directing expression of the protein encoded by said DNA in a vector- 
containing cell. 

55. A cell which comprises the DNA of claim 39. 

5 56. The cell of claim 55, said cell being a plant cell. 

57. The cell of claim 55, said plant cell being hypersensitive to sugar. 

58. The cell of claim 55, said plant cell being less sensitive to sugar. 

59. A transgenic plant comprising the DNA of claim 31 integrated into the 
genome of said plant, wherein said DNA is expressed in said transgenic plant. 

10 60. The plant of claim 59, wherein said DNA is expressed under the control of 

a constitutive promoter. 

61. The plant of claim 59, wherein said DNA is expressed under the control of 
a regulated promoter. 

62. A seed from a transgenic plant of claim 59. 

15 63. A transgenic plant comprising the DNA of claim 39 integrated into the 

genome of said plant, wherein said DNA is expressed in said transgenic plant. 

64. The plant of claim 63, wherein said DNA is expressed under the control of 
a constitutive promoter. 
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65. The plant of claim 63, wherein said DNA is expressed under the control of 
a regulated promoter. 

66. A seed from a transgenic plant of claim 63. 

67. A method of producing a plant HXK polypeptide comprising: 

(a) providing a cell transformed with a gene encoding a polypeptide 
comprising an amino acid sequence substantially identical to the amino acid sequence 
of AtHXKl (SEQ ID NO: 1) positioned for expression in said cell; 

(b) expressing said plant HXK polypeptide; and 

(c) recovering said plant HXK polypeptide. 

68. A method of producing a plant HXK polypeptide comprising: 

(a) providing a cell transformed with a gene encoding a polypeptide 
comprising an amino acid sequence substantially identical to the amino acid sequence 
of AtHXK2 (SEQ ID NO: 2) positioned for expression in said cell; 

(b) expressing said plant HXK polypeptide; and 

(c) recovering said plant HXK polypeptide. 

69. A method for increasing the level of a hexokinase protein in a transgenic 
plant cell, said method comprising expressing in said transgenic plant cell a 
hexokinase nucleic acid sequence. 

70. The method of claim 69, wherein said hexokinase nucleic acid sequence 
comprises a DNA sequence substantially identical to the AtHXKl nucleotide 
sequence that is shown Fig. IF (SEQ ID NO: 3). 
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71. The method of claim 69, wherein said hexokinase nucleic acid sequence 
comprises a DNA sequence substantially identical to the AtHXK2 nucleotide 
sequence that is shown Fig. 1G (SEQ ID NO: 4). 

72. The method according to claim 69, wherein said transgenic plant has an 
increased sensitivity to sugar. 
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1 CAGT6TGAGT AATTTAGATC GGTATTAGAT CCATCTTAGG TTTCTCTAAT 
51 TTCTCTCAAT TCACTCCAAA ATTTTGATTA TTTCTTCTTT CTGGCTTGTC 
101 AATTTTAGTC ATTTGTAATC CTTGCTTTTG CGATCGGAAT CGTAAAAATC 
151 CGATCTTTCT TTTAGATTCG TTTTGTTTTT GATTCCAAAT CGGAAAAATG 
201 GGTAAAGTAG CTGTTGGAGC GACTGTTGTT TGCACGGCGG CGGTTTGTGC 
251 GGTGGCTGTT TTGGTTGTTC GACGACGGAT GCAGAGCTCA GGGAAGTGGG 
301 GACGTGTTTT GGCTATCCTC AAGGCCTTTG AAGAGGATTG TGCGACTCCG 
351 ATCTCGAAAC TGAGACAAGT GGCTGATGCT ATGACCGTTG AGATGCATGC 
401 TGGTCTTGCA TCCGACGGTG GTAGCAAACT CAAGATGCTT ATCAGCTACG 
451 TTGATAATCT TCCTTCCGGG GATGAAAAGG GTCTCTTTTA TGCATTGGAC 
501 CTAGGGGGGA CAAACTTCCG TGTCATGCGT GTGCTTCTTG GCGGGAAGCA 
551 AGAGCGTGTT GTTAAACAAG AATTCGAAGA AGTTTCGATT CCTCCTCATT 
601 TGATGACTGG TGGTTCAGAT GAGTTGTTCA ATTTTATAGC TGAAGCTCTT 
651 GCGAAGTTTG TCGCTACAGA ATGCGAAGAC TTTCATCTTC CAGAAGGTAG 
701 ACAGAGGGAA TTAGGTTTCA CTTTCTCGTT TCCTGTTAAG CAGACTTCTC 
751 TGTCCTCTGG TAGTCTCATC AAATGGACAA AAGGCTTTTC CATCGAAGAA 
801 GCAGTTGGAC AAGATGTTGT TGGAGCACTT AATAAGGCTC TGGAAAGAGT 
851 TGGTCTTGAC ATGCGAATCG CAGCACTTGT TAATGATACC GTTGGAACAC 
901 TAGCCGGTGG TAGATACTAT AACCCGGATG TTGTTGCTGC TGTTATTTTA 
951 GGCACTGGGA CAAACGCAGC CTATGTTGAG CGTGCAACCG CGATCCCTAA 
1001 ATGGCATGGT CTGCTTCCAA AATCAGGAGA AATGGTTATA AACATGGAAT 
1051 GGGGAAACTT CAGGTCATCA CATCTTCCAT TAACCGAGTT TGATCACACG 
1101 CTGGATTTCG AGAGTCTGAA TCCAGGCGAA CAGATTCTTG AGAAAATCAT 
1151 TTCCGGTATG TACTTGGGAG AGATTTTGCG AAGAGTTCTT CTAAAGATGG 
1201 CTGAAGATGC TGCTTTCTTT GGCGATACAG TCCCATCTAA GCTGAGAATA 
1251 CCATTCATCA TTAGGACTCC TCACATGTCG GCTATGCACA ACGACACTTC 
1301 TCCAGACTTG AAGATTGTTG GGAGCAAGAT TAAGGATATA TTGGAGGTCC 
1351 CTACAACTTC TCTGAAAATG AGAAAAGTTG TGATCAGTCT CTGCAACATC 
1401 ATAGCAACCC GAGGAGCTCG TCTCTCTGCT GCTGGAATCT ATGGTATTCT 
1451 GAAGAAACTG GGAAGAGATA CTACTAAAGA CGAGGAGGTG CAGAAATCGG 
1501 TTATAGCCAT GGATGGTGGA TTGTTTGAGC ATTACACTCA GTTTAGTGAG 
1551 TGTATGGAGA GCTCACTAAA AGAGTTGCTT GGAGATGAAG CTTCAGGAAG 
1601 CGTTGAAGTC ACTCACTCCA ATGATGGATC AGGCATTGGA GCTGCGCTTC 
1651 TTGCTGCTTC TCACTCTCTC TACCTTGAAG ACTCTTAAAA CCTACCCAAA 
1701 GAGCGCCATT TTTCGGTAAT TTACTGAAAG CTTTTCGCTA TCAGAAAACG 
1751 CCTAAGCCAA GTTCTAAGGC GTCATAAAAG AAAGCATTCC ATGTTTTTAC 
1801 TCTTCCCCAA GACTTTCTTT GTAGCAAATA AGTTTCCTTG GGAGAAATAT 
1851 TTGTTTTCAT GTTCTTCAAA AATAAAAGAC TCAGTTCTTC AGATTCTGGG 
1901 ATTTTATTAT AACCAGATAT GTTGTAAAAA CTACAAATTC AAAGCTCACT 
1951 TCACTGGAGT TCTGAGTATA TAAAGATTTC ATTTTTCCTA AAAAAAAAAA 
2001 AAAAAACTAA ATTACTCACA CTC AAAAAAAAAA 
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1 CAGTGTGACT AATTTAGATC ATCTCTAGCG TTCTTAAAGT TTCCAACTTT TTTTTTTTAT 
61 TAATTTGGGC CAACTTTTTT GTTTTATTAA TTTGGGCCAA CCTTTTTTGG 1TTGAGAATT 
121 GGGCGAGGGA GAAAGATGGG TAAAGTGGCA GTTGCAACGA CGGTAGTGTG TTCGGTGQCG 
181 GTATGTGCGG CGGCGGCGTT GATAGTACGG AGGAGAATGA AAAGCGCAGG GAAATGGQCA 
241 AGAGTGATAG AGATATTGAA AGCCTTTGAA GAAGATTGTG CAACGCCAAT TGCCAAATTG 
301 AGACAAGTGG CTGATGCTAT GACTGTTGAG ATGCATGCTG GTCTTGCITC TGAAGGTGGC 
361 AGCAAGCTTA AGATGCTTAT TAGCTACGTT GATAATCTTC CTTCTGGGGA TGAGACTGGT 
421 TnTTCTATG CGTTGGATCT AGGCGGAACA AACTTCCGTG TTATGCGTGT GCTTCTTGGT 
481 GGGAAGCACG ACCGTGTTGT TAAACGAGAA TTCAAAGAAG AATCTATTCC TCCTCATTTG 
541 ATGACCGGGA AGTCACATGA ATTATTCGAT TTTATCGTTG ATGTTCTTGC CAACTTTGTC 
601 GCTACAGAAG GCGAGGACTT TCATCTCCCA CCTQGTAGAC AACGGGAACT AGGTTTCACT 
661 TTCTCATTTC CGGTTAAGCA QCTATCTTTA TCCTCTGGCA CTCTCATCAA CTGGACAAAG 
721 GGCTTTTCCA TTGACGATAC AGTTGATAAA GATGTTGTTG GAGAACTTGT TAAAGCTATG 
781 GAAAGAGTTG GGCTGGACAT GCITGTCGCA GCGCTTGTTA ATGATACCAT TGGAACACTT 
841 GCGGGTGGTA GATACACTAA CCCGGATGTC GTTGTCGCAG TTATTTTGGG CACCGGCACA 
901 AATGCAGCCT ATGTCGAACG TGCACATGCA ATTCCCAAAT GGCATGGTTT GCTACCCAAA 
961 TCAGGAGAAA TGGTGATCAA CATGGAATGG GGAAACTTCA GGTCATCACA TCTTCCATTG 
1021 ACAGAGTACG ACCACTCTCT AGATGTCGAT AGTTTGAATC CTGGTGAACA GATTCTTGAG 
1081 AAAATCATTT CCGGAATGTA TCTGGGA6AA ATCTTGCGTA GAGTTCTTCT GAAGATGGCT 
1141 GAAGAAGCTG CCTTCTTTGG CGATATCGTC CCACCTAAGC TGAAAATACC ATTCATCATA 
1201 AGGACCCCCA ACATGTCTGC TATGCACAGT GATACTTCCC CGGATTTGAA GGTTGTAGGA 
1261 AGCAAGTTAA AAGACATATT GGAGGTCCAG ACTAGTTCTC TGAAGATGAG GAAAGTTGTG 
1321 ATCAGCCTAT GTAACATCAT TGCAAGCCGA GGAGCTCGTT TATCTGCTGC GGGGATCTAT 
1381 GGAATCCTCA AGAARATAGG AAGAGACGCA ACAAAAGATG GAGAAGCTCA GAAATCTGTG 
1441 ATAGCGATGG ACGGTGGQCT ATTCGAGCAT TACACTCAGT TCAGTGAGTC GATGAAGAGT 
1501 TCATTGAAAG AGTTGCTTGG AGATGAAGTT TCAGAGAGTG TTGAAGTGAT ACTGTCGAAT 
1561 GATGGTTCAG GTGTTGGAGC TGCATTACTT GCTGCTTCTC ACTCTCAGTA TCTCGAACTT 
1621 GAAGATGACT CTGAAACAAG TTAATTTAAA G CTTTTTTST GTTTAACCTT CTTCTTGTTG 
1681 CGTAGGTTAA CAATAAAAGT AGAGGTAAAT GCCTTTGGGA AATTTTATTT TTGACAA3TT 
1741 TCAGGAACAA TAAAACCTGG ATTCTTCATC AAAGCTCTGG GAAATTCAAA CGACCAGCCA 
1801 ATGTTGTAGA ACTATACATA TATATTCGAG TTCTTTCTAT GAAAAAAAAA AAAAAAAAAA 
1861 AACCTTAAAT TACTCACACT GGC 
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